Why add regularization term to W-Loss function?

Seungjun_Lee · May 11, 2024, 4:25am

Here why we’re adding the regularization term to the W-Loss function?
I understand that Critic wants to maximize W-Loss and we enforce L-1 continuity by adding regularization term, which gets bigger if norm of critic is bigger than 1.

But Isn’t that if norm of gradient of critic is bigger than 1, then it actually increasing loss function, thus benefiting Critic? Isn’t that if we want penalize lshouldn’t we subtract it?

gent.spah · May 11, 2024, 7:53am

The additional added regularization helps in keeping the norm of the gradient of the critic close to 1. This is the same concept as normal regularization where addition of a term is performed to clip the weights, the determining factor is lambda hyper-parameter!

Topic		Replies	Views
Week 3 Gradient Penalty Build Basic Generative Adversarial Networks week-3	4	230	July 8, 2023
Why do we add the regularization term? Improving Deep Neural Networks: Hyperparameter tun week-1	5	43	August 30, 2024
Why does regularization reduce w? Improving Deep Neural Networks: Hyperparameter tun	7	585	August 18, 2023
Questions on regularization Improving Deep Neural Networks: Hyperparameter tun	2	469	July 17, 2023
Why we need to add regularization lambda function into the cost function since we already did the regularization in the gradient descent Supervised ML: Regression and Classification week-3	5	452	January 15, 2024

Why add regularization term to W-Loss function?

Related topics