Train/Dev/Test Distributions Lecture Clarification

Anbu · August 15, 2021, 3:10pm

Hi Sir,

@paulinpaloalto @lucaswener @Vasanth @carloshvp @ashimono @Sabs @manifest @fginac @Jendoubi

Im not able to understand the context “consider important to do well on” what does it meant ? can u please help to understand ?

choose a dev set and test set to reflect data you expect to get in future and consider important to do well on.

we can able to understand the point choose a dev set and test set to reflect data you expect to get in future but consider important to do well on …meaning what sir ?

thearkamitra · August 15, 2021, 5:30pm

Hi,

When you are building the model, you do not ideally have access to the test data. You only have access to the dev set or the validation data. Thus you might want to choose the model which has the best accuracy/ loss on the dev set. However, the distribution of the dev set might be very different from the test set. So you would get very bad accuracy on the test set; something which you do not want.

On the other hand, you might have generated a validation set which is exactly equal to the test set and just left it at that. You did not try to improve the score on the dev set which leads to a low score on the dev as well as test set.

Ideally, you would want a good score on the dev set and the dev set should be a good representation of the test set.

Hope this helps!

Anbu · September 5, 2021, 1:55pm

Sir As a conclusion we can say that consider important to do well on means, we need to make best accuracy results on the dev set right ?

thearkamitra · September 8, 2021, 7:42am

Sometimes accuracy is not the best metric. But yes, you should choose the model weights which optimizes the metric you want!

Topic		Replies	Views
Week 1: Train / Dev / Test video Improving Deep Neural Networks: Hyperparameter tun coursera-platform	9	382	August 8, 2024
Dev/test set doubt Structuring Machine Learning Projects coursera-platform	3	404	August 28, 2023
Week1 quizz: very confused about train/dev/test set and when to add new data to which set Structuring Machine Learning Projects week-module-1 , coursera-platform	2	423	February 1, 2024
The consequence of different distribution in train dev and test Structuring Machine Learning Projects coursera-platform	1	793	May 22, 2021
Week 2, quiz answer not clear Structuring Machine Learning Projects week-module-2 , ai-discussions	1	28	August 13, 2025

Train/Dev/Test Distributions Lecture Clarification

Related topics