Feature Scaling - When to Scale

rmwkwok · July 13, 2023, 8:43pm

This post discussed the relationship between learning rate size, feature scales, and cost contour regularity. You will also notice that the discussion is based on a lecture slide so you might review some lectures again for more explanation.

The key of feature scaling is for all features to span over a similar range, not the best range and not a small range. Usually people applies one of the first three methods in this wikipedia section to all features for the job. Those three methods will all result in a “small range” around zero, but being small is not the key, having a similar range among all scaled features is the key.

If you scale all features to a similar range, you might achieve a more regular cost contour ( as examplified in the lecture slide quoted in the linked post), if you exempt some features from scaling, you take the risk of having a less regular contour.

I recommend you to try to answer your own questions by doing some real experiments on different datasets. That will give you a more concrete idea, and you will be able to practice what you are trading off when not scaling each and every one of the features using the methods I mentioned in that wikipedia section.

Raymond

Topic		Replies	Views
Feature Scaling: Why not use separate learning rates instead of rescaling features? Supervised ML: Regression and Classification week-2	1	385	August 5, 2023
Feature Scaling Supervised ML: Regression and Classification week-2	6	570	July 3, 2023
Is my understanding of Feature Scaling correct? Supervised ML: Regression and Classification week-2	3	528	August 12, 2022
Doubt in Feature scaling Supervised ML: Regression and Classification week-2	7	580	November 5, 2022
About gradient descent and Features scaling Supervised ML: Regression and Classification week-2	6	553	August 19, 2022

Feature Scaling - When to Scale

Related topics