retain_graph=True and disc_opt.zero_grad()

Ashish_Jha1 · December 8, 2022, 10:27am

When training generator, .detach() is not called on discriminator. when we train the generator, we need the gradients of the discriminator, because the cost by definition is computed using the output of the discriminator (Detach() used in Assignment 4 - #4 by paulinpaloalto). now if we don’t do disc_opt.zero_grad() before generator training (after discriminator training) , won’t gradient from previous discriminator training accumulate while training generator

Topic		Replies	Views
Course 1 Week 1 Programming Assignment Build Basic Generative Adversarial Networks week-module-1	4	466	September 25, 2023
retain_graph=True? Build Basic Generative Adversarial Networks week-module-1	3	586	June 1, 2022
C1W1 Training Loop Build Basic Generative Adversarial Networks week-module-1	2	508	May 22, 2023
Your First GAN assignment: use of retain_graph=True Build Basic Generative Adversarial Networks week-module-1	2	775	April 14, 2022
Saving computational graph during discriminator backpropagation Build Basic Generative Adversarial Networks week-module-1	1	30	July 8, 2024

retain_graph=True and disc_opt.zero_grad()

Related topics