Although I understood the -1 is related to batch size creation for fully connected layers, still a bit confused

paulinpaloalto · December 8, 2025, 11:01pm

We are assuming that the first dimension is the “samples” dimension, right? So aren’t they the same thing? Meaning that the size of the first dimension (index 0 in python) is the number of samples. So it’s the size of a dimension. And the -1 just means “use whatever the size is”, meaning that the code works for any batch size.

You don’t need to specify the output size, since it’s determined by the sizes of all the dimensions other than the first dimension (samples dimension). So it’s a very nice and general way to write the code.

We can construct some experiments using torch.Tensor.view and torch.flatten, similar to this post about numpy np.reshape.

I will play around with this, but it may take me a few hours (have some real life to take care of ). Stay tuned!

Topic		Replies	Views
W 2 A1 \| Exercise 2 : X_flatten = X.reshape(X.shape[0], -1).T? Neural Networks and Deep Learning coursera-platform	3	639	September 27, 2022
W3, assignment one_hot_matrix function Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	512	June 4, 2022
Flattening Images in the Logistic Regression Assignment in Course 1 Week 2 Neural Networks and Deep Learning week-module-2 , coursera-platform	6	2857	July 27, 2021
Course 1 Week 2 Ungraded Lab: Beyond Hello World, A Computer Vision Example Introduction to TF for Artificial Intelligence ... week-module-2	6	421	July 7, 2023
Reshape array - Basic Python Neural Networks and Deep Learning coursera-platform	2	579	July 18, 2021

Although I understood the -1 is related to batch size creation for fully connected layers, still a bit confused

Related topics