How to ensure your models are not overfiting?

guandaline · August 11, 2021, 6:33pm

How to know that the model is improving and generalizing and not just overfiting on your data

yurij · August 14, 2021, 7:21am

You should test it on properly constructed validation and test sets. The validation test comes from the same distribution as the training data, and you should be continuously testing it while training. If you see that your validation set metrics start to worsen, this is the moment when you start overfitting. The validation set should be “hard” enough and be stratified, in other words, ensure that it’s large enough and represents the classes present in the training set well.

When you’ve trained your model and are happy with your validation metrics, you should test your model on the test set. A properly constructed test set is the one that resembles the real-world data (including the classes distribution) as best as possible. Ideally, you should get data points for your test set from a different source than the train & val sets.

There are two good Twitter threads related to your question:

KyoCheng · November 16, 2022, 2:48pm

Hi @guandaline !

I also recommend have a look at the free ebook (Machine Learning Yearning) on DeepLearning.AI website.

The website link is: Resources - DeepLearning.AI

I have a better understanding about training & tuning a ML model after reading this book.

Hope you will enjoy it too!

Topic		Replies	Views
Test set and Validation set Advanced Learning Algorithms week-3	10	520	January 15, 2023
Train untill convergence of loss? Advanced Learning Algorithms week-2	8	169	May 17, 2024
When to say we are overfitting the dev set? Structuring Machine Learning Projects	6	645	October 12, 2022
Questions about automatically choosing model Advanced Learning Algorithms week-3	5	355	August 31, 2023
Regarding Overfitting Introduction to TF for Artificial Intelligence ... week-3	4	512	August 12, 2022

How to ensure your models are not overfiting?

Related topics