Week1 Lecture1 Query regarding the point mentioned at time 10.30. Train/dev sets

Chaitanya · March 8, 2022, 2:11pm

Professor has recommended an option regarding train/dev sets.

You train our your training set, try different model architecture and evaluate on dev set. We use that iterate to get a good model.

Does he mean to train the same data set with different models and choose 1 best model?
I’m actually confused; How we train on one model and try another architecture for dev? Since, when we train one model, we will have trained parameters for that architecture and can those params be used for another architecture? How is it possible?

Lucian_Vasile_Popa · March 8, 2022, 2:25pm

He probably wants to say you will train different models on the same dataset and test in on the dev set and move forward with optimization of the hyper-params with the model that performs best.

paulinpaloalto · March 8, 2022, 6:33pm

Yes, it as Lucian says: the point is that you make a set of choices for all your “hyperparameters”, then you train on the training set. Then you use that model to compute the predictions on the dev set. If the results are not good (underfitting, overfitting), then you adjust your hyperparameters and try the whole cycle again. “Rinse and repeat”.

Then once you have good performance on the train and dev sets, then and only then you evaluate that model on the test set. The point is that the test set can’t be used in any of the previous training so that you get a more accurate view of how your trained model will perform on “real world” inputs that it was not trained on (i.e. “has never seen before”).

Topic		Replies	Views
Confusion about Training Set vs. Dev Set Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	820	December 19, 2021
Selecting the right model Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	529	July 5, 2022
Why not evaluate models on test set? Deep Learning Resources	4	221	July 12, 2022
Train/Dev/Test explanation Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	921	September 25, 2021
Differences in Dev and Test set Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	652	September 23, 2021

Week1 Lecture1 Query regarding the point mentioned at time 10.30. Train/dev sets

Related topics