GAN Course 1 - Week1 assignment: Why batchnormalization is not used in discriminator block?

Nikhil_Bhide · November 1, 2021, 10:23am

Batch normalization is used in the generator block and its use is intuitive. As per my
understanding, it must have been used as a remedy on covariate shift and internal covariate shift issues. On the similar lines, I would expect to make use batch normalization in discriminator block as well. Discriminator is a neural network with 3 hidden layers, so internal covariate shift issue is a possibility. Requesting you to elucidate the reasoning behind not using batch normalization in discriminator NN.

aray · November 1, 2021, 2:05pm

I suppose the reason for this is that if you apply BN in the discriminator for both real and fake images, the learned BN variables will aggregate both distributions, which could break or slow down the training since fake images do not contain any useful information at the beginning of the training and their distribution will change over the time, so it will be challenging for the discriminator to train.

Topic		Replies	Views
Week 2 assignment batch norm Build Basic Generative Adversarial Networks week-module-2 , week-module-3	3	555	January 28, 2023
Batch Normalization Intuition Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	600	November 22, 2022
C4_W4_Lab_1_First_GAN discriminator training data question Generative Deep Learning with TensorFlow week-module-4	7	396	December 10, 2023
Batch Normalization Intuition questions Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	8	86	July 19, 2024
Batch norm in the critic in WGAN-GP assignment Build Basic Generative Adversarial Networks week-module-3	4	355	June 5, 2023

GAN Course 1 - Week1 assignment: Why batchnormalization is not used in discriminator block?

Related topics