Why does regularization reduce w?

saifkhanengr · May 9, 2023, 3:04pm

First, understand why we need the regularization term. When our model overfits, a good job at training data but a poor job at test, it means our model does not generalize well. So, we need to penalize it. Penalization means we need to increase the cost (J). So, when we add that extra regularization factor, it means we are trying to increase the cost. However, at the same time, we also try to decrease the cost by optimization (gradient descent).

So, gradient descent will try to reduce the cost while regularization term will try to increase the cost. That is a clash, right? The more the lambda value, the more the cost will be. But gradient descent is trying hard to minimize the cost, so it will reduce the value of parameters (W).

Best,
Saif.

Topic		Replies	Views
Will Lambda reduce the size of the w parameters? Supervised ML: Regression and Classification week-module-3	7	497	May 6, 2023
Question on how Lambda works Supervised ML: Regression and Classification week-module-3	9	506	February 22, 2023
Why we need to add regularization lambda function into the cost function since we already did the regularization in the gradient descent Supervised ML: Regression and Classification week-module-3	5	457	January 15, 2024
Explanation of Lambda in Regularization of Linear Regression Cost Function Supervised ML: Regression and Classification week-module-3	2	132	July 21, 2024
Questions on regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	470	July 17, 2023

Why does regularization reduce w?

Related topics