Week 2 Derivatives: Logistic Regression as a neural network

Amarsingh_Thakur · September 28, 2021, 5:48pm

In the video, Andrew defined a function f(a). Isn’t derivative a slope of the tangent at a given point on the curve. So by that logic, isn’t the derivative of f(a) on that graph would be 0 and we cannot draw tangent and hence cannot find the derivative of the function? Any thoughts on this, what am I missing?

jonaslalin · September 28, 2021, 6:11pm

The derivative of the sigmoid activation function is not zero for small input values. However, as you see, for values far from zero, the gradient approaches zero. With too small gradients, learning stops, which is one reason why the ReLU activation function is so popular.

Amarsingh_Thakur · September 29, 2021, 10:39am

Hey, I’m not talking about the derivative of the sigmoid function. In the week 2 video name Derivatives, Andrew gave an example as f(a) = 3a and he computed the derivative of this function.

I cleared my doubts though. The derivative of such a function turns out to be constant. so for any change in value ‘a’, the slope would always be constant.

Thanks for the reply.

Topic		Replies	Views
Week 2 on logistic regression gradient descent Neural Networks and Deep Learning coursera-platform	7	644	January 27, 2022
Sigmoid Function Intuition AI Discussions	2	62	September 16, 2023
Week 4, Last assignment / General question Neural Networks and Deep Learning coursera-platform	2	538	December 5, 2021
Why does the activation function's slope matters instead of its log? [Week 3, Activation Function's Video at 4:20] Neural Networks and Deep Learning coursera-platform	2	542	September 1, 2021
C1_W3_General Question (Math, Calculus Fundamentals) Neural Networks and Deep Learning coursera-platform	3	519	October 30, 2022

Week 2 Derivatives: Logistic Regression as a neural network

Related topics