Week4 - Why explicitly calculate dA? Not required in Week2 and 3

G11 · October 13, 2021, 3:21pm

I wonder why we suddenly bother to introduce dA in week 4? I understand that it is a part of the calculus ( dA[l]=W.T[l+1]*dZ[l+1] and dZ[l]=dA[l]*g’(z[l]) ), but why not stick to the known quantities W and dZ? We use dA[l] to derive dZ[l], but to initialize the back prop we can use dZ[L]=A[L]-Y, so we don’t really have to explicitly calculate the dA at all. Why introduce another quantity now?

Thanks.

paulinpaloalto · October 14, 2021, 3:20am

Because Week 4 is the point at which we finally reach the fully general case. You can make some shortcuts with the 1 or 2 layer cases, but that no longer works so well in the general case. You need to compute dA^{[l]} at each layer, not just the output layer.

Of course there is a certain amount of discretion here as well. You could probably formulate this in different ways, but Prof Ng is teaching the class: he has chosen to formulate it in the way that he thinks makes the most sense. When you’re teaching the class, you will get to choose the formulation.

Topic		Replies	Views
Assignment Building NN C1 Week 4 Neural Networks and Deep Learning coursera-platform	11	648	August 16, 2022
Backpropagation week 3 vs week 4 Neural Networks and Deep Learning coursera-platform	4	585	August 5, 2022
W4 _L_model_backward_Why do we need dA0 Neural Networks and Deep Learning coursera-platform	2	523	December 27, 2022
Week 4's Quiz. Is the grader right? Neural Networks and Deep Learning week-module-4 , coursera-platform	4	41	February 13, 2025
W4_A1_Video Lecture on Forward & Backward functions Neural Networks and Deep Learning coursera-platform	4	563	January 15, 2023

Week4 - Why explicitly calculate dA? Not required in Week2 and 3

Related topics