Week 2 - Clarity on DCGAN

AbelTee · December 30, 2023, 12:15am

I’m on week 2 C1_W2 assignment, which is building my first DCGAN.
so i need clarity on few things please.

I need more light on why we had to use deconvolution(nn.ConvTranspose2d) for the generator and nn.Conv2d for the discriminator.
I would like to know why 0.5 was passed into the normalization method. what effect would it have on the result if other values were used.

image810×110 3.76 KB
the weight_init function created got me lost. why do we use it?
those values came from where. I will so much be glad if anyone can break this down for me.

paulinpaloalto · December 30, 2023, 2:10am

This was covered in the lectures, wasn’t it? The fundamental point is that the generator needs to expand the dimensions of the data. In most cases, we start from a one dimensional “noise” vector of some size and we want the generator to produce a much larger object like an image. A transpose convolution is the “inverse” of a convolution: the generates a larger output if properly configured. So the usual technique is to cascade a number of transpose convolutions to create the output, frequently with other layers like activations included.

For the discriminator on the other hand, we are building a binary classifier: it takes a large object like an image and turns that into a single bit “fake/not fake” classification output. A normal convolution reduces the dimension of input to output if properly configured. We cascade a number of convnet layers with activation together and then train them to produce the desired “yes/no” output. We see convnets of that style used as classifiers all the time, e.g. in DLS Course 4.

In neural networks of any kind, we need to initialize the weights randomly in order to implement “symmetry breaking”. If you start with the weights the same, then when you run the training all the neurons will learn the same thing, which is not useful. That initialization routine is a pretty standard one that uses a normal distribution with \mu = 0 and \sigma a small value.

Well, you are welcome to try different values and see how it affects the results. Here’s the docpage for the torchvision.normalize function. I confess that the comment there does not map to my understanding the actual code that they gave you there. Having \mu = 0.5 and \sigma = 0.5 will not result in only values between -1 and 1 at least according to my understanding.

AbelTee · December 30, 2023, 11:19am

Thank You so much that was helpful.

Wendy · January 6, 2024, 4:47pm

I dug into the pytorch normalize function to understand how the code comment relates to the parameters passed in:

In the documentation, it says that the transform normalizes each channel using this function:

output[channel] = (input[channel] - mean[channel]) / std[channel]

So, the parameters are not actual mean and std values, but are adjustments to make to the input data to affect those values.

In particular, in the assignment, the input MNIST dataset has values ranging from 0.0 to 1.0, so:

a “mean” parameter of 0.5, (input[channel] - mean[channel]) will give us values ranging from -0.5 to 0.5, shifting the mean to the left by 0.5.
then, taking that value / std[channel] for “std” parameter 0.5, we stretch out the original std to get values ranging from -1.0 to 1.0

This is why the comments talk about normalizing the input values to fit the range -1.0 to 1.0

paulinpaloalto · January 6, 2024, 5:08pm

Thanks, Wendy! That makes total sense now. I was just thinking about “normal” distributions in general and hadn’t taken the step of going back and looking at the notebook to remind myself of what the inputs actually look like here.

Topic		Replies	Views
W2 lab: ConvTranspose2D vs ConvTranspose1D Build Basic Generative Adversarial Networks week-2 , week-3	2	543	January 12, 2023
Purpose of using Transposed Convolutions in the Generator Build Basic Generative Adversarial Networks week-2 , week-3	1	26	October 14, 2024
Working of DCGAN Build Basic Generative Adversarial Networks week-2	3	353	January 12, 2024
WGAN Weight Initialization Generative Adversarial Networks Resources general	1	61	July 10, 2024
C1W4A Build_a_Conditional_GAN Build Basic Generative Adversarial Networks week-4	2	626	July 7, 2022

Week 2 - Clarity on DCGAN

Related topics