Week 3 - Please explain how we got to this backward propagation result?

Yash_Rathi · November 15, 2021, 3:51pm

I am at the last optional lecture
Someone please explain, how did we got to this result? We know the expected output of only the final layer, how would be propagate that error in the hidden layer?

paulinpaloalto · November 15, 2021, 5:29pm

That is just an application of the Chain Rule between layer 1 and layer 2. Prof Ng has specifically designed this course not to require knowledge of calculus, so he just presents the formulas and does not show how to derive them. Here’s a thread with links to the derivations.

Yash_Rathi · November 15, 2021, 6:17pm

@paulinpaloalto thank you so much I was stuck there. I didn’t tried doing by the chain rule. I thought you would somehow need to know the error in the preceding layers, but you only know the desired outcome of the final layer.

This is what I did, from the help of the thread.

paulinpaloalto · November 15, 2021, 7:41pm

The point is that the dZ^{[l]} formula is just between two layers, so it ends up being one factor in the overall calculation of the gradients we actually care about which are dW^{[l]} and db^{[l]} and those are w.r.t. the final cost J and thus involve the multiplication of the chain rule factors at each layer.

Yash_Rathi · November 16, 2021, 5:40pm

Hey @paulinpaloalto could you explain how (dz[2]) w[2] * g[1]' (z[1]) = dz[1], gets us to dz[1] = (w[2])T dz[2] * g[1]' (z[1]). why the transpose?

yujin_lee2 · February 12, 2023, 12:52pm

Did you solve this problem?

paulinpaloalto · February 12, 2023, 3:58pm

The simple answer is that the dimensions don’t work if you don’t include the transpose. If you want to understand why it comes out that way and more about the math behind that, please have a look through the links provided on this thread, which was also given earlier in this thread.

Topic		Replies	Views
Week 3 Exercise 6 (backward propagation) Neural Networks and Deep Learning coursera-platform	6	785	June 29, 2022
Course 1 Week 3 Backpropagation Intuition (Optional) Neural Networks and Deep Learning coursera-platform	5	810	December 18, 2021
Week 4 Video: Forward and Backward Propogation Neural Networks and Deep Learning coursera-platform	1	562	July 10, 2021
Week 3, "Gradient Descent for Neural Networks" Neural Networks and Deep Learning week-3 , coursera-platform	10	472	March 25, 2024
Didn't get the derivation Neural Networks and Deep Learning week-4 , coursera-platform	4	25	October 14, 2024

Week 3 - Please explain how we got to this backward propagation result?

Related topics