Need clarification

Praveen_Titus_F · November 28, 2022, 6:05pm

Hi there, I ve two doubts

Q1) Is it mandatory to use regularization in all regression algorithm cost functions or only when there is overfit in training?

Q2) Is regularization in cost function used for each “d” in (d = 1, 2,…,10), I mean when training each order of polynomial.

Thanks in advance !!!

Juan_Olano · November 28, 2022, 6:36pm

Hi @Praveen_Titus_F ,

Is it mandatory to use regularization? strictly speaking, no, it is not ‘mandatory’ per se. You can decide to use it or not. Now, if your NN is overfitting, then regularization is one technique to cure the NN from overfitting.

What happens if the NN doesn’t have overfitting and we still use regularization? Usually nothing bad will happen. Your NN will be trained and it may even help to get a better generalization.

You’ll find 3 types of regularization: L1, L2, and Dropout.

L2 is probably the most common one. In this type, the loss function is extended by a term that penalizes the sum of squares of the weights (aka weight decay).

L1 also extends the loss function with a term that penalizes the sum of absolute values of the weights.

Dropout acts differently: it randomly drops units (neurons) from the layer. These dropped units will not have any impact in the model’s performance while training.

Check out these 3 types of regularization in lessons or in google - it is important to understand them very well.

Thanks,

Juan

rmwkwok · November 29, 2022, 2:48am

Hello @Praveen_Titus_F,

I want to focus on your Q2. The answer is yes, but instead of saying “for each d”, I would like to introduce that we would want to say “for each weight”, which are the w_1, w_2, … that we stick next to each of the (polynomial) features. Because the number of weights can be larger than the number of “d” when we include cross terms such as x_1x_2, and also because regularization terms are getting those weights involved directly, such as for d = 1, the L2 regularization can be expressed as \lambda w_1^2.

Therefore, yes, regularization is applied on all the weights or all the w's.

Cheers,
Raymond

Praveen_Titus_F · November 30, 2022, 3:34pm

Thank you @Juan_Olano and @rmwkwok !!!

Topic		Replies	Views
Why we need to add regularization lambda function into the cost function since we already did the regularization in the gradient descent Supervised ML: Regression and Classification week-3	5	452	January 15, 2024
Advice needed for neural network with regularization Advanced Learning Algorithms week-3	3	377	August 22, 2023
Does Regularization affects all weights equally? Supervised ML: Regression and Classification week-3	2	434	May 28, 2023
Video: Cost function with regularization Advanced Learning Algorithms week-1	3	513	June 3, 2023
Regularization Intution Supervised ML: Regression and Classification week-3	1	422	June 6, 2023

Need clarification

Related topics