Gradience Descent Backpropagatin Calculation

totochan1985 · January 24, 2022, 7:30pm

Does anyone have a link/note as how the gradient values with respect to dz1 etc. are calculated for the vectorized implementation?

paulinpaloalto · January 24, 2022, 8:37pm

I may be answering a different question than you are asking, but Prof Ng did give the formulas in the lectures for all the elements of the gradient calculations. Here’s the expression he gives for dZ^{[1]} in the Week 3 lectures for a specific 2 layer network:

dZ^{[1]} = ( W^{[2]T} \cdot dZ^{[2]} ) * g^{[1]'}(Z^{[1]})

Where g^{[1]} is the activation function for layer one, meaning that you need the derivative of that function.

If the question is, why is that the formula? That is beyond the scope of this course. Here’s a thread with links to the derivation of back propagation in general and references to the matrix calculus that is needed for the derivation.

Topic		Replies	Views
Course 1: Week 3 (backpropagation intuition) Neural Networks and Deep Learning	21	5204	April 27, 2022
Week 3: computing derivatives for shallow network Neural Networks and Deep Learning	2	681	January 26, 2022
How did we calculate dz[2] in Backpropagation Intuition (8:34)? Neural Networks and Deep Learning	1	645	March 6, 2022
Derivative of Z1 Neural Networks and Deep Learning week-4	9	258	February 24, 2025
Exercise 6 - backward_propagation in Programming Assignment Week 3 Neural Networks and Deep Learning	8	697	October 27, 2022

Gradience Descent Backpropagatin Calculation

Related topics