Rescaling methods for outlier data

Yuri_Dulkin · July 2, 2022, 7:19am

Hi all,

I just finished the rescaling lesson, and I wanted to ask if there are ways to tackle data sets that contain data that isn’t neatly distributed?

If I were to use these rescaling methods with data that has outlier tails or is distributed in a non-normal fashion, It would be challenging to keep that rescaled data in a confined range.

I guess there are methods for normalizing data (not just rescaling it)?
Would we encounter them somewhere during the specialization?

Thanks for your help,
Yuri.

TMosh · July 2, 2022, 7:56am

In practice we don’t really need a strictly confined range. Anything that gets the features into a zero-mean and a range of less than an order of magnitude will work fine (say between maybe -3 and +3).

Scaling by 1/ the standard deviation is a good choice - or if you know your data has a different distribution, you can use that instead.

Topic		Replies	Views
Question about rescaling Supervised ML: Regression and Classification week-2	4	504	July 7, 2022
Feature Scaling - What method to choose? Supervised ML: Regression and Classification week-2	3	396	August 18, 2023
Question: Why we don't use standardization in feature scaling? Supervised ML: Regression and Classification week-3	3	490	July 24, 2022
C1_W2 - Feature Scaling for Unbounded Data Points Supervised ML: Regression and Classification week-2	2	512	August 17, 2022
Mean Normalization VS other forms of Feature Scaling Supervised ML: Regression and Classification week-2	2	530	July 28, 2022

Rescaling methods for outlier data

Related topics