Vanishing/Exploding gradients C2W1

Rajendra_Ambati1 · January 2, 2023, 6:12am

what is vanishing/exploding gradients and how to mitigate those??

Christian_Simonis · January 2, 2023, 6:35am

Hi there

Vanishing gradients occur when the gradients of the parameters of a DNN become so small, that the model learns only very slowly and it seems „nothing“ is happening.

Exploding gradients is describing the opposite situation when the gradients are getting super large, causing e.g. numerical issues.

You can mitigate e.g. w/ the use of activation functions like ReLU, see also this thread:

Activation functions - #2 by Christian_Simonis

Further best practices for mitigation include weight initialisation, weight decay and batch normalization to stabilise the activation. It’s also possible to clip the weights w/ bounded optimization or reduce the learning rate if you see gradients exploding. It makes also sense to monitor your gradient flow, see also this thread!

If you want to read more also with respect to additional mitigation techniques, feel free to take a look at this Source.

Best
Christian

paulinpaloalto · January 2, 2023, 4:49pm

In addition to Christian’s excellent explanations, note that Prof Ng discusses those topics at several points in the C2 lectures. Have you gotten to the lecture “Vanishing / Exploding Gradients” in C2 Week 1 yet?

Topic		Replies	Views
So, what is vanishing/exploding gradient? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	955	August 19, 2023
W1 assignment_Initialization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	460	July 24, 2023
Vanishing/Exploding Gradient Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	553	June 8, 2022
Vanishing / Exploding Gradients : week1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	575	June 19, 2021
Vanishing / Exploding Gradients Improving Deep Neural Networks: Hyperparameter tun week-module-1 , coursera-platform	5	568	January 11, 2024

Vanishing/Exploding gradients C2W1

Related topics