Problem understanding the shape of w

jvomaldonado · March 29, 2025, 12:26am

In the second week, we learn that in simple logistic regression, the weight matrix w has shape (n_x, 1). This means w has n_x rows and 1 column, so each feature weight is stored in its own row.

However, in the third week, second lecture, when we move on to hidden layers, we learn that the new weight matrix w has shape (n, n_x), where n is the number of neurons in that layer and n_x is the number of “features” (i.e., the number of neurons in the previous layer).

Towards the end of the video, the professor draws a network with 3 inputs, one hidden layer of 4 neurons, and one output neuron. If we focus only on the hidden layer and the output layer, it looks like logistic regression, with the hidden layer activations serving as inputs. From the second week’s perspective, I would expect the weight matrix to be (n_x, 1), which in this case would be (4, 1). However, the professor says it’s (1, 4). Why does this difference occur? It’s confusing me a lot.

Thanks!

TMosh · March 29, 2025, 2:08am

It is a very loose standard, but generally a weight matrix size is outputs x inputs.

paulinpaloalto · March 29, 2025, 3:22am

Here’s a historical thread with some explanation of what is happening when Prof Ng defines the weight matrices in DLS C1 Week 3.

dtonhofer · March 29, 2025, 11:33am

Well, here is the course convention for “deep neural network classification” (as opposed to “single-layer network doing logistic regression”). I’m continually updating this diagram with observation gleaned from later parts of the course. Hopefully it’s informative.

Topic		Replies	Views
Size of W matrix Neural Networks and Deep Learning	8	556	October 24, 2023
Question of Week3 'Explanation for vectorized implementation Neural Networks and Deep Learning	7	355	September 14, 2023
Need some help on the shape of W[k] matrix Neural Networks and Deep Learning	5	358	April 12, 2024
[DLS 1] Week 3 Exercise 3 - Weights Matrix shape Neural Networks and Deep Learning	2	462	August 6, 2023
Question regarding week 3 video 3 "computing a nn's output" Neural Networks and Deep Learning week-3	2	17	October 21, 2024

Problem understanding the shape of w

Related topics