A doubt on Week 3 Lecture

JJaassoonn · August 2, 2023, 12:03pm

Dear Administrator,

Could you please guide me on this issue?

“Note that for any variable, foo and dfoo always have the same dimensions. That’s why, w and dw always have the same dimension. Similarly, for b and db, and z and dz, and so on.”

May i know the reason of this statement is valid?

Thank you

saifkhanengr · August 2, 2023, 12:06pm

In the assignment, you can confirm this statement by printing the shape of W and dW, b and db, Z and dZ, A and dA (of the same layer). You will see every pair has the same shape but the one contain values and another one it’s derivatives.

JJaassoonn · August 2, 2023, 12:36pm

Dear Mr Saif,

According to the lecture,

The dimension of Z[1] is (n[1], 1)
There is no any explanation on why dZ[1] has a dimension (n[1],1) as same as Z[1], except just only this statement

“Note that for any variable, foo and dfoo always have the same dimensions. That’s why, w and dw always have the same dimension. Similarly, for b and db, and z and dz, and so on.”

Based on the ambiguous assumption, dZ[1] is defined to have a dimension of (n[1],1). So we have to arrange a element-wise multiplication between (W[2].T) (dZ[2]) and g[1]'(Z[1]), just to fulfill the requirement of having (n[1],1) dimension.
(W[2].T) (dZ[2]) is defined to calculate using dot product. In the same way, the element-wise multiplication between (W[2].T) (dZ[2]) and g[1]'(Z[1]) can be changed to using dot product as well.

May i have any idea to understand the abovementioned statement?

Thank you

JJaassoonn · August 2, 2023, 1:17pm

Yes, i noticed that this is true when i tried it in the assignment. But i find that it is quite coincident so that i am looking for an idea of understanding the theory behind it.

saifkhanengr · August 2, 2023, 1:36pm

The dimension of Z depends on the dimension of W and X, right? And the dimension of X depends on the number of features and number of examples. Moreover, the dimension of W depends on the number of neurons and number of features. Prof. Andrew explain the dimensions deeply in one of his video.

JJaassoonn · August 2, 2023, 2:03pm

Yes, every term is clear except this dZ[1] = (W[2].T) (dZ[2]) * g[1]'(Z[1]).

According to the lecture, there is an assumption mentioning that dZ[1] has a same dimension as Z[1], So we have to arrange a element-wise multiplication (instead of dot product) between (W[2].T) (dZ[2]) and g[1]'(Z[1]), just to fulfill the requirement of having (n[1],1) dimension.

The dimension of dZ[1] depends on the dimension of W[2].T , dZ[2] ,and g[1]'(Z[1]).

Element-wise Multiplication:
dZ[1] = (W[2].T) (dZ[2]) * g[1]'(Z[1])

Dot Product
dZ[1] = (W[2].T) (dZ[2]) . g[1]'(Z[1])

Two different calculations produce different dimensions of result. How should we ensure that dimension of dZ[1] = dimension of Z[1]?

saifkhanengr · August 2, 2023, 2:14pm

This is the derivative calculation. To understand this equation, you have to be familiar with Calculus. There are many posts related to this topic. You can read this and this and yet this one.

Topic		Replies	Views
Question regarding dimensions of w in logistic regression Neural Networks and Deep Learning coursera-platform	3	344	October 13, 2023
Wrong values for dW1 Neural Networks and Deep Learning coursera-platform	9	567	January 16, 2022
Course 1 Week 3 Backpropagation Intuition (Optional) Neural Networks and Deep Learning coursera-platform	5	811	December 18, 2021
The dimensions of dW Neural Networks and Deep Learning week-3 , coursera-platform	4	35	February 6, 2025
Is W[1] the transposed version on W Neural Networks and Deep Learning coursera-platform	1	505	June 2, 2022

A doubt on Week 3 Lecture

Related topics