Data input normalization

Matim · April 27, 2021, 10:23am

Hi!
In the course it is emphasized that features of input X should be normalized before running a NN model. Explanations given are clear to me. But I’m wondering, what about the response variable Y, for example in non classification problems such as regression?
My intuition is that input Y should be left unchanged. But one example I have in mind is when Y can be potentially big compared to normalized features in X - could this then be a problem to a NN?

Thanks!

bahadir · April 28, 2021, 9:11pm

Hi @Matim,

In most cases, you would do just fine without normalizing “y”. However, it could cause exploding weights in NN due to the large error values.

Best,
Bahadir

jonaslalin · April 28, 2021, 9:31pm

Hi @Matim,

If you want to study an example where output normalization achieves better results, I recommend you to check out Effect of transforming the targets in regression model — scikit-learn 0.24.2 documentation.

jonaslalin · April 28, 2021, 9:49pm

Also fun to read

suki · April 29, 2021, 1:03am

`

I think that is a very good point. I do believe having a target variable in a totally different scale makes it harder to train.
To add my two cents,
In practice, I simply end up scaling the target. Note that it is a bit different from normalizing . Rescaling means the values are made to be within the same range. Normalizing can entail a certain degree of change in distribution, which is not always desirable. You want to make sure your target variable still retains the distribution of the original data.

bahadir · April 29, 2021, 4:32pm

Hi Suki,

Thanks for the additional input. I didn’t go into particulars much.

Scaling and normalization each have their advantages and disadvantages. I believe, from min-max scaling which is also a normalization technique, you can revert back to the original values. But as you warned, there are methods like L2 normalization that wouldn’t make it possible.

Best,
Bahadir

Topic		Replies	Views
Output normalisation in multi-output regression NN AI Discussions	5	272	December 17, 2022
Question about feature scaling Supervised ML: Regression and Classification week-2	5	31	August 17, 2024
Is it necessary to normalize target in training data? Machine Learning Specialization	5	369	July 6, 2022
DLS, Course 2, Week 1, Question to normalizing Inputs Improving Deep Neural Networks: Hyperparameter tun	7	484	April 27, 2023
Input data normalization Improving Deep Neural Networks: Hyperparameter tun	2	694	July 12, 2022

Data input normalization

Related topics