How to choose model to deploy

peachans · December 30, 2023, 6:16pm

Let’s say, I have done my cross validation for two model which is linear regression and random forest when choosing the model to deploy in production should i choose it from k fold cross validation score or should I use that both model to test it on test set and choose the best one the have best performance on test set. My first understanding is after we get some best model from cross validation we choose it again with test set. But i saw many kaggle competition the most or the winner choose the model from cross validation score.

TMosh · December 30, 2023, 6:32pm

In my experience K-fold validation is often used if the data set is too small to split into separate training, validation, and test sets that are large enough to be statistically significant.

Remember that Kaggle hosts contests where they need a well-defined numerical ranking among the competitors. It has different goals than you would face in real-world solutions.

rmwkwok · December 30, 2023, 8:17pm

Hello @peachans,

As explained in the course, we use the cross validation score to pick the best model.

Cheers,
Raymond

Topic		Replies	Views
What dataset in the final deployment we should use? Supervised ML: Regression and Classification week-module-3 , dl-ai-learning-platform	1	16	January 11, 2026
Choosing the best ML Model Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	599	July 15, 2021
Questions about automatically choosing model Advanced Learning Algorithms week-module-3	5	358	August 31, 2023
K-Fold Cross-validation AI Discussions	1	747	August 27, 2023
C2_W3 Model selection and training/cross validation/test sets Advanced Learning Algorithms week-module-3	11	637	April 1, 2024

How to choose model to deploy

Related topics