Feature Scaling - What method to choose?

vvsvictor · August 18, 2023, 10:05am

I’m not sure which scaling method I should use, in which cases is it better to divide by the max, mean normalization, or z-score normalization?

TMosh · August 18, 2023, 3:56pm

This is totally up to you.
Primarily it depends on how you want to handle outliers in the data set, and its overall statistics.

using the range (max - min) will put a lot of emphasis on outliers.
using the standard deviation will minimize the impact of outliers.

Deepti_Prasad · August 18, 2023, 4:17pm

Hello victor,

Z-score normalization or standardisation is the process of features scaling so they all have the properties of a Gaussian distribution, i.e mean is 0 and standard deviation 1.

if you distribution is not gassian distribution or the standard deviation is very small, then mean normalization works better. mean normalization basically fixed the range of data between 0 and 1 or -1 and 1

Disadvantages: when we normalise data, it is sensitive to outliers, so if there are outliers in your dataset then mean normalization is not preferred and that is when standardisation or z-score normalization can be used.

Outliers are observation of data that does not fit the rest of the data, it has extreme values away from the usual data presentation in model analysis.

Regards
DP

TMosh · August 18, 2023, 5:00pm

Note that the outliers may be the most important data in the model, so be careful how they are considered.

Topic		Replies	Views
Mean Normalization VS other forms of Feature Scaling Supervised ML: Regression and Classification week-2	2	530	July 28, 2022
Impact of Feature Scaling on underlying distribution Supervised ML: Regression and Classification week-2	7	252	April 16, 2024
Question: Why we don't use standardization in feature scaling? Supervised ML: Regression and Classification week-3	3	489	July 24, 2022
C1_W2_Course_Feature_Scaling Supervised ML: Regression and Classification week-2	3	515	July 3, 2022
When to use which normalization Supervised ML: Regression and Classification week-2	4	138	October 16, 2024

Feature Scaling - What method to choose?

Related topics