Hi all,
I would like to ask a question regarding feature scaling, I know there are many methods to do it, and in the course only 2 are mentioned
1 - xj / Max(X) || Result will be between 0 and 1
2 - (xj−μj) / (xj range) || Result will be between −1 and 1
But both methods are depending on max and range (max - min) so so they might be affected by outliers of the data, and as far as I read, the best way to avoid outliers in feature scaling is to use Standardization
3 - (xj - μj) / stdv(X) || Result: no specific range
So I need to know why this method was not mentioned, is that related its’ result? should the result be in specific range in order to avoid any issue in the models? and if yes, what are the potential issues that might happen?
Thanks for your time and great effort