Confusion regarding input matrix shape

shrey1411 · November 19, 2025, 1:26pm

Is it true that in ML frameworks like PyTorch & TensorFlow, shape of input matrix is (number of examples x number of feaures), whereas in Andrew Ng’s implementation it’s (features x examples).

balaji.ambresh · November 19, 2025, 1:31pm

Frameworks typically follow (num examples, features per example). Please see this link to know about notations in courses.

shrey1411 · November 19, 2025, 1:36pm

Yes but why the distinction? Is there a reason why andrew ng’s implementation is different?

balaji.ambresh · November 19, 2025, 1:44pm

I don’t know. Adding @paulinpaloalto and @TMosh who might.

paulinpaloalto · November 19, 2025, 3:16pm

It is an arbitrary choice. In DLS Professor Ng uses the features by samples representation for courses 1 and 2, where he is dealing with samples that are vectors. But once he gets to Course 4 (ConvNets) where the inputs are images in the form height by width by channels, then he switches to the “samples first” orientation for the data, because that’s the way everyone does it when the input batches are 4 dimensional tensors. Then you still have the choice of whether the channel dimension comes before or after the height and width dimensions. In TF, it is m x h x w x c, but in torch it is usually m x c x h x w. But even in torch you have a choice if memory serves.

Of course the choice of orientation as n_x x m in DLS C1 and C2 has a big effect on how the rest of the formulas are written in terms of the weight matrices and how the dot products are done.

Topic		Replies	Views
Neural Network_input shape_Batch size vs Features Neural Networks and Deep Learning coursera-platform	10	570	April 8, 2023
W4_Video 3_Doubt in input shape Neural Networks and Deep Learning coursera-platform	3	449	July 18, 2023
Dimensions question Neural Networks and Deep Learning coursera-platform	5	581	March 15, 2025
Why have the shapes of Y changed from (num classes x training examples) to (training examples x classes) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	491	January 25, 2022
Course 1: Week 2: Binary Classification X.shape convention Neural Networks and Deep Learning coursera-platform	4	686	February 28, 2022

Confusion regarding input matrix shape

Related topics