Question on Train/Test Split and Data Handling in Pre-Trained Models for NMT Assignment

yos83 · December 4, 2024, 1:41am

Hi there,

I hope you’re doing well. I am almost finished with the Deep Learning Specialization and am currently working on the Programming Assignment for the Neural Machine Translation module (Sequence Models).

I noticed that from the second week of the Convolutional Neural Networks (CNNs) modules, the data isn’t explicitly split into training, test, and validation sets. I was wondering why this split isn’t shown in the code. Is it because the pre-trained models (like YOLO, for instance) already have the training and test splits set up when they are used? Similarly, with RNN models, for example, in the project “Neural Machine Translation with Attention,” the model is based on a human-readable dataset with equivalent standardized machine-readable data. The data is preprocessed, but I am curious about where the X and Y splits occur for training and testing.

Typically, when building models, the data is split into training and testing sets before being passed to the model. For instance, in object detection tasks with YOLO, the dataset is split into training and validation sets using either functions like train_test_split from sklearn.model_selection or by organizing the data into separate folders (such as train and test). However, I couldn’t see this step in the code provided in the assignment.

I understand that evaluation is performed when the predict function is defined, and the data is mapped into a designated directory in the project. However, I would still appreciate clarification on how and where the training and test splits occur in the provided code. Is this something I need to handle myself, or is it integrated within the pre-trained models?

Thank you for your time, and I look forward to your response!

Best regards,
Yosmery Gonzalez

TMosh · December 4, 2024, 8:14am

When you’re using a pre-trained model, the model has already been trained, tested, and built. So it already contains the weights, and you can skip over those parts of the model building process.

balaji.ambresh · December 4, 2024, 2:21pm

A model is defined only by its weights. When it comes to a custom dataset, you have to prepare the train / validation / test splits. You can either prepare them ahead of time or dynamically as part of running in the training process.

As far as the assignment notebooks are concerned, please focus on the concepts rather than on the splits since they are meant for learning purposes. Do see lectures from courses 2 and 3 on how to split data depending on the number of data points you have to start with and apply those heuristics to build / evaluate your model.

yos83 · December 4, 2024, 11:38pm

Thank you so much for your answers

Topic		Replies	Views
Deep learning specialisation NLP with Probabilistic Models project	2	23	December 4, 2024
MobileNetV2 transfer learning Convolutional Neural Networks	1	498	July 22, 2022
C2 W1 problem on creating the split function Convolutional Neural Networks in TensorFlow week-1	1	403	August 12, 2023
Straight Up Stuck Convolutional Neural Networks in TensorFlow week-1	5	428	June 26, 2023
Tensorflow Documentation about image_dataset_from_directory Convolutional Neural Networks	2	570	May 25, 2022

Question on Train/Test Split and Data Handling in Pre-Trained Models for NMT Assignment

Related topics