Activation function Lecture Video 4:13

Anbu · June 16, 2021, 2:55pm

Dear Mentor,

We cannot understand the intuition behind how when the slope is close to zero, the gradient descent becomes slow down? Usually when the slope is zero means gradient descent converge to global minimum right. Then how gradient descent becomes slow down when the slope of the function is close to zero ? Can u please help to understand this ?

Now, one of the downsides of both the sigmoid function and the tanh function is that if z is either very large or very small,then the gradient or the derivative or the slope of this function becomes very small.So if z is very large or z is very small,
the slope of the function ends up being close to 0.And so this can slow down gradient descent.

cmarquay · June 17, 2021, 10:29am

W and b are updated with the values of dW and db. dW and db are derivatives. If you look at the curve given in the course, you see that it’s almost horizontal for very small or very large values. This means that the derivatives for these values are almost zero (very close to zero). If you update W and b with very low values of dW and db, you’ll find values close to those of W and b. Your algorithm will only move very slowly.

If you look at the curve, you will also see that it is not convex. In this case, I think you have to distinguish between different aspects of the course.

Topic		Replies	Views
Activation functions Lecture Video 4:13 Neural Networks and Deep Learning coursera-platform	2	547	May 25, 2021
Np.random.randn(5,10) * 0.01 Neural Networks and Deep Learning week-module-3 , coursera-platform	3	233	February 1, 2024
Week 2 Derivatives: Logistic Regression as a neural network Neural Networks and Deep Learning coursera-platform	2	604	September 29, 2021
Course 1 Week 2,Logistic Regression Gradient Descent Neural Networks and Deep Learning coursera-platform	6	888	March 26, 2022
Why does the activation function's slope matters instead of its log? [Week 3, Activation Function's Video at 4:20] Neural Networks and Deep Learning coursera-platform	2	548	September 1, 2021

Activation function Lecture Video 4:13

Related topics