Why is the Weight Matrix the transposed of NN's?

crackingthecode · June 6, 2021, 5:53pm

Z calculation for neural network is:
Screen Shot 2021-06-06 at 12.39.43 PM

Please note that here W is NOT transposed

However, for the logistic regression it is transposed

What’s the reason for flipping the weight matrix?

vjmalkoti · June 9, 2021, 8:49pm

We apply the dot product between W and X to form the equation w1x1 + w2x2 + ... + wnxn for each x example. The dot product requires that number of columns in first matrix should match the number of rows in second, A (p, q) . B (q, r).

In logistic regression initializations, W and X have same orientation as there’s a one-to-one relation. Therefore, we apply transpose operation on the matrix to make them dot-product compatible such that the dot-product results in a single value for each x example. This value for each example is its y value.

In neural network, the matrix orientations are different. Dimensions of first layer’s W is
number of units in first layer X number of input features. That means, each row corresponds to one activation which has weights for each of the input features - a many-to-many relation. The dimensions of the X is number of features X number of examples. Since the matrices are already dot-product compatible, there’s no need to transpose either matrix. Similarly in second, third and deeper layers, the dimensions of W are number of units in the layer X number of units in previous layer which makes them compatible with previous layer’s A matrices for dot-product. The final layer has units number of output classes X number of units in previous layer which maps the calculated value to output classes.

crackingthecode · June 16, 2021, 5:31am

Thanks for your answer. My question is why are the orientations different? Seems like a strange convention.

Topic		Replies	Views
Is the reason why we transpose a matrix, is such that we orientated for it to be dot product and produce our intended results? Neural Networks and Deep Learning	7	816	January 20, 2025
Cant understand a matrix Neural Networks and Deep Learning	5	1208	March 8, 2024
Transpose of the weight matrix Neural Networks and Deep Learning	6	967	August 9, 2021
Ambiguity regarding weight matrix in Graded Quiz - Week 3 Neural Networks and Deep Learning	4	540	November 9, 2023
Question regarding dimensions of w in logistic regression Neural Networks and Deep Learning	3	331	October 13, 2023

Why is the Weight Matrix the transposed of NN's?

Related topics