Confusion about Calculating dZ^[l]

paulinpaloalto · October 26, 2022, 4:19am

Well, it may not be intuitive, but you just have to work out the math, remembering that we’re dealing with the output layer here and the activation function is sigmoid.

Prof Ng shows in the lectures and it is given in the notebook that:

dA^{[L]} = - \left (\displaystyle \frac {Y}{A^{[L]}} - \frac {(1 - Y)}{(1 - A^{[L]})} \right )

Now substitute that in your second formula and remember that because of the aforementioned sigmoid, we have:

g^{[L]'}(Z^{[L]}) = A^{[L]} (1 - A^{[L]})

So you can start from the fully general formula that we use in the hidden layers (as Phuc has shown) or you can use the special simplifications that you get because of the specifics of the output layer.

Topic		Replies	Views
I don't know the difference between dZL = AL - Y and dZL = dAL .* g'(ZL) Neural Networks and Deep Learning	2	787	February 8, 2022
Sigmoid Function in Layer L Neural Networks and Deep Learning	8	721	January 30, 2023
Week 3: Why dZ^[1] = W^[2]T dZ^[2] * g^[1]'(Z^[1]) Neural Networks and Deep Learning	3	903	February 13, 2023
The intuition of db^[l]=dz^[l] and da^[l-1]=w^[l-1].dz^[l] Neural Networks and Deep Learning	4	785	May 27, 2023
Assignment Building NN C1 Week 4 Neural Networks and Deep Learning	11	621	August 16, 2022

Confusion about Calculating dZ^[l]

Related topics