Choosing a model based on training and validation errors

TanaySontakke · December 13, 2023, 5:00am

In the Optional Lab for Model Evaluation and Selection, in the last part where NNs are applied for Classification, it is suggested that if the CV error is the same for 2 models (in this case NNs 2 and 3 in the workbook), we should choose the model with the lower training error, which makes sense.

But in this particular case, NN 2 is simpler than NN 3 (5 layers vs 6 layers), both give same CV error and NN 3 obviously has lower training error, being the more complex NN. So, both NN 2 and NN 3 fit the CV set equally well, but NN 2 is simpler.

So could a case also be made for choosing NN 2 rather than NN 3 because it’s simpler? Like in a real-world scenario, if 2 NNs give the same CV error, but one is far simpler than the other, wouldn’t it be preferable to choose the simpler one? (considering computational resources required etc)

rmwkwok · December 13, 2023, 6:48am

Hello @TanaySontakke,

I think you have made a very good point. I would choose NN2 (with 749 trainable parameters which is less than NN3 with 869 parameters). I will share this with the course team too.

Btw, that CV set only has 40 samples and uses a discrete-type error, so it is easier for us to see the same CV error from two NNs.

Cheers,
Raymond

Topic		Replies	Views
Training models Advanced Learning Algorithms week-3	1	119	May 26, 2024
C2W3_Lab_01_Model_Evaluation_and_Selection - Classification Advanced Learning Algorithms week-3	5	458	April 28, 2023
C2_W3 Model selection and training/cross validation/test sets Advanced Learning Algorithms week-3	11	612	April 1, 2024
Model selection question Advanced Learning Algorithms week-3	5	407	July 3, 2023
Practice-lab C2_W3 . CV TRAIN high bias(simpler model) Advanced Learning Algorithms week-3	4	498	July 23, 2022

Choosing a model based on training and validation errors

Related topics