Why cross validation

Jules_Gransden · May 19, 2023, 4:55pm

It’s not quite clear why we use cross-validation instead of just performing tests on all models, and the one with the lowest mean cost will be our optimal model. Doing a cross-validation sequence just seems unnecessary.

ahs95 · May 19, 2023, 5:47pm

Cross-validation provides a more robust estimate of a model’s performance compared to a single train-test split. It helps in evaluating how well the model generalizes to unseen data.
Helps to determine if your model is affected by underfitting or overfitting.
Can assist in selecting the best model or parameters for the particular task.
If you check the optional videos of the skewed dataset [most probably in week 2] , you can realize the importance of cross validation.

Pardon me If I’ve made any mistake.
Thank you

TMosh · May 19, 2023, 6:37pm

In a word, 'Overfitting".

Keep in mind the goal is to get a model that makes good predictions on new data. The goal is not just getting low cost on the training set - that has no value since we already have labels for the training set, so we don’t need predictions there at all.

Christian_Simonis · May 19, 2023, 6:37pm

In addition to @ahs95‘s great reply:

In that case you have quite a risk of Survivorship Bias. By cross validation you basically can test several splits and more variation of data which should usually help to prevent overfitting compared to the scenario you outlined. Hope that helps, @Jules_Gransden.

Best regards
Christian

Christian_Simonis · May 19, 2023, 6:40pm

Here are two threads which I would recommend to take a look at:

Best regards
Christian

Topic		Replies	Views
Cross validation sets Advanced Learning Algorithms week-3	4	424	July 16, 2023
Cross validation? Structuring Machine Learning Projects	5	711	July 18, 2021
Week 3 > Model selection and training/cross validation/test sets Advanced Learning Algorithms week-3	4	388	August 14, 2023
Questions about automatically choosing model Advanced Learning Algorithms week-3	5	355	August 31, 2023
C2_W3 model selection Advanced Learning Algorithms week-3	8	54	April 1, 2025

Why cross validation

Related topics