What dataset in the final deployment we should use?

yildirimga · January 11, 2026, 7:39am

Hi All,

In machine learning specializaiton, I am on Advance Algorithms / Week 3 –Advice for applying machine learning.

Using cross validation is a nice trick to pick a model, my question is about deployment stage.

What is the better practice in deployment: (1) use train set, check error/accuracy on cross validation set to pick a model, and then use test set to report train, and immedately deploy this to production or (2) after train-validation-test done and retrain the picked model for whole dataset including training/crossvalidaiton/test and deploy the new parameters to production?

Thanks,

TMosh · January 11, 2026, 8:08am

Never use the test set in training. It’s your independent verification that the model works well enough to meet your goals.

Topic		Replies	Views
Retrain model on the whole data set (include test set) when deploying a model Advanced Learning Algorithms week-module-3	2	424	November 6, 2023
How to choose model to deploy Advanced Learning Algorithms	2	286	December 30, 2023
Questions about automatically choosing model Advanced Learning Algorithms week-module-3	5	363	August 31, 2023
Model Selection based on CV or Test & Diff b/w CV and Test data Advanced Learning Algorithms week-module-3	17	550	December 6, 2023
About dev and test sets Advanced Learning Algorithms week-module-3	3	537	March 14, 2023

What dataset in the final deployment we should use?

Related topics