Why do we split dataset before imputation?

_22Ai · August 23, 2023, 2:05am

I’m wondering why do we split into train/test set before imputation.
Is there any downside when we do imputation before train/test split?

TMosh · August 23, 2023, 2:17am

Yes, the downside is overfitting during training. This results in a system that only works well on the training set, but cannot make useful future predictions.

Topic		Replies	Views
Why we split the our data into training and testinb/dev sets? Supervised ML: Regression and Classification week-1	3	369	August 24, 2023
Normalisation/feature scaling Advanced Learning Algorithms week-2	1	500	July 4, 2022
Train_dev_test split doubt Structuring Machine Learning Projects coursera-platform	2	540	September 21, 2022
Question on Train/Test Split and Data Handling in Pre-Trained Models for NMT Assignment Deep Learning Resources ai-discussions , project , coursera-platform	3	22	December 4, 2024
Smaller splits to compare and infer properties of model variants Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	340	October 2, 2023

Why do we split dataset before imputation?

Related topics