Week 3 (W3) - W matrix values

nicopz · May 16, 2025, 4:01pm

Hi everyone!
I have a question on the values inside the matrix W. From what I understood from the video (link in the bottom) those values are different in each row. What I don’t get is that shouldn’t those values be the same for each training example? If not what you are doing with the vectorization is changing the neuron shape. Before vectorization A[0] had a length of nx (nx being the length of each vector of each training example). Now it appears that A[0] has a length of nx times m (m bieng the number of training examples) to respect the original length the weights of matrix W should be the same in each row right? I’m confused about this.
Thanks in advance.

Nico

paulinpaloalto · May 16, 2025, 5:03pm

Yes, the rows of the W matrix represent the coefficients for a given output neuron at that layer. So they will be different per row. But they are the same for every input sample, right? Think about what happens at the first layer when you compute:

Z^{[1]} = W^{[1]} \cdot X + b^{[1]}

The matrix W^{[1]} has shape n^{[1]} x n_x, where n^{[1]} is the number of output neurons for layer 1 and n_x is the number of input features in each sample.

Then X has dimensions n_x x m where m is the number of input samples in the current batch of inputs.

So think about what will happen in that dot product W^{[1]} \cdot X: each row gets “dotted” with each sample (each column of X). The result will be that Z^{[1]} has shape n^{[1]} x m.

paulinpaloalto · May 16, 2025, 5:22pm

Here’s a thread from a while ago that also talks in some detail about this material in Week 3 and how the W matrices are organized and how they work.

nicopz · May 16, 2025, 5:54pm

Thank you so much! I got confused with the image. the one from the class and the thread you posted. In there I saw X1 X2 X3 with different wights so I thought they were different weight for different training example, but they are the same training example, just the components of one training example right?

TMosh · May 16, 2025, 6:34pm

Weights apply to each feature, not each example.

paulinpaloalto · May 16, 2025, 6:59pm

Yes, in the screenshot on that thread I linked, the values x_1, x_2 and x_3 are the features (components) of one sample value x. That would be one column of the matrix X in my notation in my post on this thread.

Topic		Replies	Views
Question regarding week 3 video 3 "computing a nn's output" Neural Networks and Deep Learning week-3 , coursera-platform	2	17	October 21, 2024
Week 3, Video 3: Understanding matrix size Neural Networks and Deep Learning coursera-platform	7	658	January 29, 2025
Week 3 Quiz - Potential Contradiction Neural Networks and Deep Learning week-3 , coursera-platform	3	249	May 6, 2024
[Course 1 Week 3] Quiz question Neural Networks and Deep Learning coursera-platform	3	649	June 14, 2022
Problem understanding the shape of w Neural Networks and Deep Learning week-2 , week-3 , coursera-platform	4	34	April 10, 2025

Week 3 (W3) - W matrix values

Related topics