Week 4 lab Generator not training

Steven_Saito · July 3, 2022, 4:13pm

Update: I found that @Elemento actually answered my question, because @Xiaojian_Deng made the same mistake I did. @Elemento’s answer is here. The gist of his answer is:

Now, though we want to call detach on the fake images when we are updating the discriminator (since we don’t want to update the generator in this case), we don’t want the same thing to happen when we are updating the generator. And hence, when you changed the position of detach method, the generator didn’t update at all, and hence, led to empty squares.

Thank you, @gent.spah, for the checklist! I found the problem, and I believe it is related to your checklist #1. The problem was that I had placed the “.detach()” in the wrong place:

This shows my ignorance of how to use the .detach() method. The detach() docs say

Returns a new Tensor, detached from the current graph.
The result will never require gradient.

but the docs don’t show examples on where it should be placed.

For where to place the .detachI(), in Week 4’s lab, I was following the pattern of the Week 1 lab:

Why is Week 4 lab different, and why does placing the .detach() in the wrong location in the Week 4 lab cause the training to fail?

BTW, this conversation Why should we detach the discriminators input ?! is very relevant, but after reading it through I don’t think it answers my question of where to put the .detach() method.

Thanks,
Steve

Topic		Replies	Views
C1W4A_Build_a_Conditional_GAN Model not learning isue Build Basic Generative Adversarial Networks week-4	9	657	October 3, 2023
Assignment passed but not working on training conditional GAN Build Basic Generative Adversarial Networks week-4	7	838	January 5, 2024
GAN.C1.W2.Assignment.Training Build Basic Generative Adversarial Networks week-2 , week-3	2	559	October 31, 2022
Reason for detach not being called in generator loss function Build Basic Generative Adversarial Networks week-1	3	571	June 12, 2023
Wk1 Programming Assignment Build Basic Generative Adversarial Networks week-1	3	30	October 7, 2024

Week 4 lab Generator not training

Related topics