Week 3 Gradient Penalty

Richeek_Arya · July 1, 2023, 3:39pm

It was glossed over in the lecture that gradient penalty term is required to enforce 1-L continuity. Moreover any gradient other than 1 is penalised.

Can somebody explain how come adding gradient penalty help achieving 1-L continuity?

Nithin_Skantha_M · July 2, 2023, 10:18am

Hi Richeek,
Welcome to the community !

Take a look at this attachment :

As you can see here, the regularization term or the gradient penalty is a positive number that is being added to the loss function, so when you are trying to minimize the loss function, you are also trying to get the regularization term close to zero such that it doesn’t have a big say in the loss and to bring it to that point eventually means that you are indirectly trying to bring the norm closer to 1 , thus indirectly enforcing the constraint. Also, it is better compared to weight clipping due to stability during the training.

Hope you get the point, if not feel free to post your queries.
Regards,
Nithin

Richeek_Arya · July 2, 2023, 5:31pm

What about when gradient is less than 1 (following L-1 continuity)? Why the critic’s loss function want it to be exactly 1?

Nithin_Skantha_M · July 3, 2023, 11:24am

It doesn’t want it to be exactly 1 but much closer to 1, say it wants it to be in this region 1-alpha to 1+alpha where alpha is a very small number.
1-L continuity asks me to keep the norm to be at most 1 at all points, it can be less than 1 but by keeping it near to 1, I’m still enforcing the constraint right.
Also, we have to consider model training too, we have to think of a way to provide stable training + following the constraints and thus it is implemented this way.

Richeek_Arya · July 8, 2023, 6:55am

I see thanks for your reply!

Topic		Replies	Views
Confused about 1-L continuity condition in W-GAN Build Basic Generative Adversarial Networks week-3	1	165	June 4, 2024
Why add regularization term to W-Loss function? Build Basic Generative Adversarial Networks week-3	1	150	May 11, 2024
1-Lipschitz continuous Build Basic Generative Adversarial Networks week-3	1	326	December 7, 2022
Doubt in gradient penalty image interpolation Build Basic Generative Adversarial Networks week-3	7	359	January 24, 2022
Back-propagation with gradient penalty Build Basic Generative Adversarial Networks week-3	1	362	January 9, 2022

Week 3 Gradient Penalty

Related topics