C2W3 Lab assignment: data dimensions in model

Ozy_Sjahputera · January 14, 2024, 7:14pm

After reading the function model() in Lab Assignment 3.3, a few questions came up:

Let C = number of features, m = number of samples

In previous sections, we implemented two functions: forward_propagation() and compute_total_loss(). In both of these functions, the shape of inputs are both (C, m). In function model(), the inputs X_train and Y_train are also provided in the shape of (C, m). However, when forward_propagation() and compute_total_loss() are called from model(), the transposed versions of X_train and Y_train are passed as arguments. Calling tr.transpose(X_train) and tr.transpose(Y_train) will change their shapes to (m, C), which is not what forward_propagation() and compute_total_loss() expect.
I know that X_train and Y_train are fed to tf.data.Dataset and used to create mini-batches. Does tf.data.Dataset or tf.data.Dataset.batch somehow change the shape of X_train and Y_train?
I also noticed that X_test and Y_test are turned into a tf.data.Dataset object and mini-batches are created from this. What are the reasons behind splitting the test (I assume test here means Validation)? The test mini-batches are used in predictions every 10 training epochs. What is the disadvantage of treating the test/validation set one batch?

Thanks

paulinpaloalto · January 14, 2024, 8:01pm

Here’s a thread which explains how the dimension orientation is handled in this assignment.

The tf.data.Dataset class does not change the orientation of the input data: it requires that the input have the samples dimension as the first dimension and then it subdivides along that dimension. The point about TF assuming samples are the first dimension is also covered in the thread I linked above.

It is generally a good idea to add the logic to process data in minibatches. The Dataset class in TF is commonly used for that. If you’re going to do that at all, then you do it for all your input datasets, because that’s the way you wrote the code.

Topic		Replies	Views
Week3 Programming Exercise, Section 3.3 Train the Model Improving Deep Neural Networks: Hyperparameter tun	8	538	April 12, 2023
TensorFlow model question Improving Deep Neural Networks: Hyperparameter tun	3	352	December 21, 2023
Confused with some TF concepts Improving Deep Neural Networks: Hyperparameter tun	5	353	October 4, 2023
Course 2, Week 3, compute_total_loss(logits, labels) Improving Deep Neural Networks: Hyperparameter tun	12	2817	November 5, 2023
Problem in exercise 6-Programming Assignment: TensorFlow Introduction Improving Deep Neural Networks: Hyperparameter tun	3	648	April 4, 2023

C2W3 Lab assignment: data dimensions in model

Related topics