W2_Video Lecture_Vectorizing logistic regression_Calculation of dw

Nishant_Mahajan · August 7, 2023, 12:36pm

I am trying to wrap my head around dw calculation. Matrix multiplication is row x column so the first element of dw should be 1 feature (all examples) [row] * dz (which is element wise multiplication)[column]. Why is professor saying X(1)dz(1). Doesn’t X(1) means first training example?

rmwkwok · August 7, 2023, 2:19pm

Hello @Nishant_Mahajan,

Both you and the lecture are correct.

When we multiply X with dz, if you agreed that dz1 will multiply with the first feature of the first sample, then when you shift your focus to the second row of X, I believe you would also agree that dz1 will also multiply with the second feature of the first sample. If we repeat this over all of the rows, then every feature of the first sample will multiply with dz1, and therefore dz1 will multiply with x1.

Another way to see what I have said is by making up a small w and a small dz on a piece of paper, do the matrix multiplication like you said, and finally highlight all of the terms about the first sample, and you will see that they multiply with and only with dz1.

Cheers,
Raymond

Topic		Replies	Views
Week 2 - Vectorizing Log Reg Grad Out - dw computation Neural Networks and Deep Learning	2	533	December 2, 2021
Course 1 Week 2 Neural Networks and Deep Learning	2	594	June 7, 2021
The dimensions of dW Neural Networks and Deep Learning week-3	4	35	February 6, 2025
Week 2: w1 and w2 as inputs for logistic regression - Gradient Descent Neural Networks and Deep Learning	3	450	October 6, 2023
C1_W2: Logistic Regression on m examples. (Error?) Neural Networks and Deep Learning	5	513	March 7, 2023

W2_Video Lecture_Vectorizing logistic regression_Calculation of dw

Related topics