Order between input feature and weights

flyunicorn · July 23, 2025, 8:44pm

Here in the graph, W times X. X, the input feature is placed on the right side of W. But in the same video with another example, the input feature is placed on the left side of W (second image). Why the difference? Which order is correct?

balaji.ambresh · July 24, 2025, 3:45am

You have to perform a dot product of the weight matrix i.e. the model weights and the input features. W x + b is the common notation for a single input x (lower case). X (upper case) represents a batch of inputs.

In the 2nd picture, look how each input is a row and the model weights are arranged as a column. So, if we consider a batch of input data, it’ll be of shape (batch_size, num_features). Model weights will have shape (num_features, 1). Matrix multiplication of X and W will have shape (batch_size, 1) as shown on the right side.

You can perform multiplication in any order so long as the operation performs the intended dot product and results are interpreted correctly.

flyunicorn · July 24, 2025, 7:11am

So it means both WX+b and XW+b is correct depending on how the row and columns is arranged in W and X?

balaji.ambresh · July 24, 2025, 1:01pm

That is correct. Libraries like tensorflow have input shape of (batch_size, num_features).

Topic		Replies	Views
Matrix multiplication on coffee roast optional lab Advanced Learning Algorithms week-module-1	7	266	June 14, 2024
Matrix lay out in the tensorflow Advanced Learning Algorithms week-module-1	6	304	January 24, 2024
Matrix multiplication in neural network Advanced Learning Algorithms week-module-1	2	346	August 30, 2024
Confusing in the Structure of Neural Network Advanced Learning Algorithms week-module-1	9	670	November 17, 2022
What the heck is W? And how do you multiply it? Neural Networks and Deep Learning coursera-platform	4	651	March 23, 2022

Order between input feature and weights

Related topics