Regulation: Why bigger Lambda leads to smaller W?

Young_Ngo · May 23, 2022, 2:03am

In video Why Regulation Reduces Overfitting, it mentions that bigger Lambda leads to smaller W. Why so?

anon57530071 · May 23, 2022, 3:15am

I suppose what Andew talked is about updating the weights by back-propagations with partial derivatives, dw = \frac{\partial J}{\partial w^{l}}.

If we add L2 regularization term with \lambda, dw also has the term of \frac{\lambda}{m}w^{l}. Updating the weights by back-propagations is to “subtract” dw from the previous weights. So, if we set a big \lambda, then, the weights go to the small values.

Topic		Replies	Views
Why W will close to 0 when lambd? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	532	August 19, 2022
Explanation of Lambda in Regularization of Linear Regression Cost Function Supervised ML: Regression and Classification week-module-3	2	222	July 21, 2024
Relation between Lambda(Regularization Parameter) and Weight? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	655	July 14, 2021
Week 1 - Doubt in the Math Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	560	May 21, 2022
Lambda, w=o, regularization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	524	December 21, 2021

Regulation: Why bigger Lambda leads to smaller W?

Related topics