Normalization before a new prediction?

Pierre_BEJIAN · August 26, 2022, 3:50pm

Hi,
I am currently working on a multivariate linear regression.
I’m a bit confused about the normalization (mean=0 and standard deviation=1). If we do normalization on our training set, shouldn’t we also normalize our cross-validation set as well as all the new values of X that the model does not know?

I have the same question for image classification with a CNN. If we do pre-processing on our training images, shouldn’t we do the same pre-processing on a new image that the model does not know?

Regards,

Pierre

SamReiswig · August 26, 2022, 8:53pm

Hi!

Yes, every transformation of the data that you do in training should be done on the new data as well.

alvaroramajo · August 27, 2022, 12:51pm

Hi, @Pierre_BEJIAN !

Just one more thing to add to the post. You should normalize your test data with your training data mean and std dev. In a real world scenario you don’t know how your input data is distributed, so you cannot rescale it with any other values than the ones from your training.

Pierre_BEJIAN · August 27, 2022, 1:34pm

You are totally right! I had guessed it but it’s good to have confirmation.
Thanks a lot

Topic		Replies	Views
C1_W2_Lab03_Feature_Scaling_and_Learning_Rate_Soln - normalizing the testing data Supervised ML: Regression and Classification week-2	6	518	July 14, 2022
Question about feature scaling Supervised ML: Regression and Classification week-2	5	34	August 17, 2024
Can someone help explain this line Supervised ML: Regression and Classification week-2	8	430	July 27, 2023
How to implement the feature scaling in prediction? Supervised ML: Regression and Classification week-2	1	524	June 23, 2022
Question about rescaling Supervised ML: Regression and Classification week-2	4	504	July 7, 2022

Normalization before a new prediction?

Related topics