[C1_W2_Assignment] Discriminator training code: Why don't use "fake.requires_grad_ = False" instead of "fake.detach()"

Wendy · December 1, 2022, 5:32pm

@TRAN_KHANH1, to add to @gautamaltman’s comments:

I can think of one reason you might want to choose to use detach(). Since detach() creates a new tensor, if you use fake.detach() for the discriminator, you could theoretically still use the same fake with your generator where you want the gradients. I think some of the assignments take advantage of this.

I suspect the course developers used detach() in all assignments for consistency to help students focus on the main concepts, but you’re absolutely right that as long as you don’t need/want to reuse fake, the requires_grad_ approach is more efficient.

Topic		Replies	Views
Assertion Error in get_disc_loss Build Basic Generative Adversarial Networks week-1	6	608	December 29, 2022
Why should we detach the discriminators input ?! Build Basic Generative Adversarial Networks week-4	4	1572	November 30, 2022
C1W1 UNQ_C6 Assertion Error without any information Build Basic Generative Adversarial Networks week-1	4	596	November 11, 2022
Error Message Concerning Building the Training Code Build Basic Generative Adversarial Networks week-1	4	530	January 6, 2023
Detach() used in Assignment 4 Build Basic Generative Adversarial Networks week-4	5	598	November 13, 2022

[C1_W2_Assignment] Discriminator training code: Why don't use "fake.requires_grad_ = False" instead of "fake.detach()"

Related topics