week-4-Backpropagation

paulinpaloalto · November 15, 2024, 9:33pm

It’s a good question that has come up before. Here’s an earlier thread that discusses the same points.

The point is that most of the formulas Prof Ng shows are for “layer” level Chain Rule factors and the \frac {1}{m} only comes in when you finally put all the Chain Rule factors together to compute the actual gradients of the weight or bias values. You could have structured things differently, but you need to make sure you don’t end up with multiple factors of \frac {1}{m}.

Of course computing that last factor \displaystyle \frac {\partial J}{\partial L} is easy: the gradient of the average is the average of the gradients. Think about it for a second and that should make sense.

Topic		Replies	Views
Backpropagation formulas Neural Networks and Deep Learning coursera-platform	7	1047	April 21, 2021
Backpropagation week 3 vs week 4 Neural Networks and Deep Learning coursera-platform	4	557	August 5, 2022
Back propagation 1 box Neural Networks and Deep Learning week-4 , coursera-platform	3	128	May 29, 2024
Intuition for why we calculate dz? Neural Networks and Deep Learning coursera-platform	3	552	April 6, 2022
Course 1 - Week 4 - 1/m in backpropagation Neural Networks and Deep Learning coursera-platform	12	668	April 29, 2024

week-4-Backpropagation

Related topics