Week 2, Exercise 4, Optional Assignment (Sigmoid derivative)

krishaned · August 31, 2021, 11:12pm

Hi, the exercise is about calculating the sigmoid derivative. I have implemented the function as:

def sigmoid_derivative(x):
s = 1/(np.exp(-x))

ds = s*(1-s)

return ds

I think am making the mistake while calculating “ds”. I am getting 2 tests passed and 1 test failed message when I run the code.

Any suggestions what could be the issue here?

Thanks

paulinpaloalto · September 1, 2021, 12:33am

Your implementation of the derivative is correct, but the s value that you feed into it is wrong. Why reimplement sigmoid manually here? You already built the function to compute that earlier, so why not save yourself some work by just calling it? But if you insist on building it again, at least compare your current implementation to the previous one. I hope the previous one is different.

Your implementation of s is:

s = \displaystyle \frac {1}{e^{-x}}

But note that is equivalent to:

s = e^x

right? That’s a little different than sigmoid, no?

thearkamitra · September 1, 2021, 7:28am

Also,

One quick way to debug these is to find the values of the different variables for different values. We know that the value at infinity (or a very high value) is 1 and the derivative at infinity is 0.

By checking the function, you can see that these are not followed.

krishaned · September 1, 2021, 8:44am

definitely calling the sigmoid function (already implemented) is the better approach. Thanks for the suggestion.

Bio_J · April 13, 2024, 5:30am

Hi,

In this exercise4, I am confused of what we are calculating, why we are calculating the gradient of sigmoid? and why the derivative is equal to

is this dg(x)/dx? Why not calculating dj/dw? Or dj/db? Since the dg(x)/dx is not useful

saifkhanengr · April 13, 2024, 1:04pm

I guess this YouTube playlist from @eddy will answer all your questions.

paulinpaloalto · April 13, 2024, 3:15pm

Sure, it is our goal to compute \frac {\partial J}{\partial w} and \frac {\partial J}{\partial b}, so the question is how you do that. It involves using the Chain Rule and if you work that out (as Eddy shows on the link that Saif gave us) it involves the derivative of sigmoid as one of the factors.

Topic		Replies	Views
Week 2 Practice Coding Assignment Exercise 4 Neural Networks and Deep Learning coursera-platform	4	553	October 12, 2022
Don't get which line of code compute Sigmoid's gradient Supervised ML: Regression and Classification week-module-3	3	500	September 28, 2022
How is the derivative s*(1-s) in sigmoid_derivative Neural Networks and Deep Learning coursera-platform	2	602	February 22, 2022
Python Basics with Numpy: Exercise 4: Sigmoid Derivative Neural Networks and Deep Learning coursera-platform	8	967	August 7, 2024
Week2- sigmoid derivative TypeError: 'numpy.ndarray' object is not callable Neural Networks and Deep Learning coursera-platform	2	605	March 14, 2022

Week 2, Exercise 4, Optional Assignment (Sigmoid derivative)

Related topics