Confused with some TF concepts

Leonard_Bouygues1 · October 3, 2023, 8:26pm

Hi there,

I am slightly confused by some TF concepts, and to understand dataset shapes.

Why are the datasets this weird generator object rather than a constant of shape (nx, m) ?
When I have a dataset (e.g. x_train):
x_train.element_spec only shows the shape of one example.
How can I see how may examples are contained? With NumPy the number of examples was contained in the shape.
In W3 exercise 6 - instructions say: It’s important to note that the “y_pred” and “y_true” inputs of tf.keras.losses.categorical_crossentropy are expected to be of shape (number of examples, num_classes).
How could I have deduced the expected shapes from the TF documentation?
In W3 section 6 (training the model) why is a transpose of minibatch_X passed? forward_propagation(tf.transpose(minibatch_X), parameters) my understanding is that X_train is of shape (input size = 12288, number of training examples = 1080) and that forward_propagation() takes input parameter X of shape (input size, number of examples)

Thanks for the help.

TMosh · October 3, 2023, 9:07pm

What “weird generator object” are you referring to ?

A TensorFlow model doesn’t care how many examples there are, that doesn’t impact the model design. You don’t have a size until you fit the model to a specific data set.

Try x_train.shape perhaps.

The TensorFlow documentation assumes you are already are an expert. Sometimes experience is the best teacher. Tip: Often you have to look up the properties of the parent object, when one exists.

Sometimes you have to transpose things in TensorFlow simply to avoid getting error messages about shapes. There aren’t a lot of standards about what the defaults shapes should be.

Other mentors will probably have more technically-oriented explanations. I’m the practical voice.

rmwkwok · October 3, 2023, 11:02pm

Hello @Leonard_Bouygues1,

For your questions 2, 3, and 4, they will become clear after some works and works are something I can suggest for you.

Generator-type allows you to feed data to the training process without having to preload everything into memory which is obviously necessary when your data size is larger than memory size. This is a course and it is something we need to learn and be able to use.
Check out the cardinality() method in the tensorflow’s Dataset documentation and study it for how it works. For other approaches, search on stackoverflow for also discussions of their pros and cons.
Check out the axis parameter for its explanation in the documentation, think about it and experiment with it.
Take a minibatch_X out, print its shape, and determine why a transpose operation was needed.

Good luck!
Raymond

Leonard_Bouygues1 · October 4, 2023, 7:42am

Thank you @rmwkwok and @TMosh

An additional question if you don’t mind:

In the Course #1, Prof Ng advises to not use Numpy arrays of shape ( n , ) and to initialize instead to shapes (n , 1)

In the Course #2, W3, Exercise - the dataset has initial shape of (64, 64,3). We then use tf.reshape(image, [-1 , ]) to reshape it into (12888, )

what does the coma followed by nothing symbolize?
could we not specify [-1,1] to get a (12888,1) shape? Seems to break the model later on.

Thanks for your lights

Kic · October 4, 2023, 10:05am

Hi @Leonard_Bouygues1 ,

Prof Ng’s advice is a prudent advice, explicitly specifying what the array should be.
When a shape component is -1, there is a special meaning to it. Here is quoted from Tensorflow.org:
If one component of shape is the special value -1, the size of that dimension is computed so that the total size remains constant. In particular, a shape of [-1] flattens into 1-D. At most one component of shape can be -1.

paulinpaloalto · October 4, 2023, 9:30pm

That means that it is a 1D array or tensor, meaning that there is literally only one dimension.

Topic		Replies	Views
Week3 Programming Exercise, Section 3.3 Train the Model Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	538	April 12, 2023
C2W3 Lab assignment: data dimensions in model Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	1	322	January 14, 2024
Shape of minibatch - Week 1 DLS5 Sequence Models coursera-platform	1	471	May 8, 2023
Neural Network_input shape_Batch size vs Features Neural Networks and Deep Learning coursera-platform	10	540	April 8, 2023
An important miss from instructors NLP with Sequence Models course-related , ai-discussions	2	56	October 16, 2024

Confused with some TF concepts

Related topics