How does regularisation on a polynomial regression model help to selectively shrink the less significant parameters more in comparison to the more significant ones?

Nityunj_Goel · January 28, 2024, 8:07am

When Andrew Ng was explaining regularization with the example wherein he added 1000w3 + 1000w4 to the loss function, it’s clearly evident how will this help selectively penalise the values of higher order term’s parameters and therefore reduce overfitting. But in practice, we regularise all the parameters using the same constant lambda and I do understand the impact on the model when lambda=0 or lambda is very high but I’m unable to understand how will an intermediate value selectively penalise the higher order terms and not penalise the lower order ones the way it was happening in the example.

TMosh · January 28, 2024, 8:48am

It doesn’t selectively penalize higher order terms. It regularizes all of them.

The example used in the lecture is rather a misleading worst-case situation.

Topic		Replies	Views
Doubt in regularised regression Supervised ML: Regression and Classification lecture-help , week-3	2	162	May 5, 2024
How does regularization work for logistic regression Supervised ML: Regression and Classification week-3	4	255	March 21, 2024
Regularization lambda Supervised ML: Regression and Classification week-1	3	450	September 23, 2023
Regularization : Do larger weights imply complex model? Advanced Learning Algorithms week-3	5	614	September 7, 2022
Explanation of Lambda in Regularization of Linear Regression Cost Function Supervised ML: Regression and Classification week-3	2	106	July 21, 2024

How does regularisation on a polynomial regression model help to selectively shrink the less significant parameters more in comparison to the more significant ones?

Related topics