Confusion with WGAN-GP Loss equation for the Critic

Ajeesh_Ajayan_N · August 26, 2023, 10:26pm

In the video lecture of Week 3 named ’ 1-Lipschitz Continuity Enforcement’, its mentioned that the formula for WGAN-GP Loss is:

min g max c E(c(x)) - E(c(g(z)) + lambda*E(||gradient(c(x_hat))||2 - 1)^2

Now here the first term is Expectation of Critic of Real Images and the second term is Expectation of Critic of Fake Generated Images

Whereas, in the Week-3 assignment for WGAN, the working code for calculating loss is:

crit_loss = torch.mean(crit_fake_pred) - torch.mean(crit_real_pred) + c_lambda*gp

Now, in this the first term is for the fake generated images.

This is a bit confusing or am I missing something? Please help me with this issue.

Thanks a lot in advance ,
Fellow Coursemate

Nithin_Skantha_M · August 27, 2023, 1:03am

Hi Ajeesh_Ajayan!
Welcome to the community

From the equation, you can infer that, for the critic, the loss is calculated by maximizing the distance between the critic’s predictions on the real images and the predictions on the fake images while also adding a gradient penalty.

In the programming assignment, we calculate the negative of this distance. Why so? When it comes to implementation, we minimize a loss function with back-prop but our requirement here is to maximize the distance (maximize the critic’s loss). When you talk about absolute values, maximizing a positive value is to minimize a negative value, isn’t it? ( Let’s say we maximize a number 5 to 7 => this is equivalent to minimising -5 to -7 right ? (absolute value is same)).
This is what they are trying to do here → so instead of maximizing the loss function directly they are minimizing the negative of the loss function. Hope you get the point, if not feel free to post your queries.

Regards,
Nithin

Ajeesh_Ajayan_N · August 27, 2023, 1:43am

Hi Nithin,

Thanks a lot for your explanation. The problem I am facing with this equation is that, if they are actually using the negative of the loss function, shouldn’t the Penalty part also be negative?

Thats positive in both the equations.

Please clarify this.

Thanks again.

Nithin_Skantha_M · August 27, 2023, 2:09am

Good question.

The loss function is (min g max c E(c(x)) - E(c(g(z))) + penalty. We are adding penalty separately to the loss function (It has to be minimized) and the min-max part is broken down as I said before.
So it seems that it retains the same form ( wherein it is actually getting added to the loss function separately).

loss = original_loss + penalty ; [original loss = -(distance)]

In the first equation, loss = whole loss (min-max representation of the equation) + penalty ; in the implementation, loss = -(distance) + penalty. Hope this helps.

Ajeesh_Ajayan_N · August 27, 2023, 3:27am

Ohh got it!! Thanks a lot for the clarification.

Writobrata · September 29, 2023, 4:00pm

If you use the formula as mentioned in the lectures, you must set lambda to be negative value. But if you use the formula as in the notebook you set lambda to be positive value

Topic		Replies	Views
Confused by WGAN C1W3 assignment UNQ_C4 Build Basic Generative Adversarial Networks week-1	3	480	June 5, 2023
Critic Loss Function Build Basic Generative Adversarial Networks week-3	3	413	January 19, 2023
Why is the Generator Loss in WGAN negative mean of the predicted image Build Basic Generative Adversarial Networks week-3	4	465	September 26, 2022
C1_W3_WGAN-GP_Assignment Negative Loss for Generator & Critic Build Basic Generative Adversarial Networks week-3	2	317	February 19, 2024
A question about WGAN's objective function Build Basic Generative Adversarial Networks week-3	4	389	December 11, 2022

Confusion with WGAN-GP Loss equation for the Critic

Related topics