Working of DCGAN

Prasanna_Hegde · January 7, 2024, 3:50pm

C1_W2_Assignment

I did not quite understand the architecture of the generator mentioned in the notebook. (image in notebook)
I also did not quite understand the statement for the generator
“You may notice that instead of passing in the image dimension, you will pass the number of image channels to the generator. This is because with DCGAN, you use convolutions which don’t depend on the number of pixels on an image. However, the number of channels is important to determine the size of the filters.”
I did understand to get a image out of a noise vector we use deconvolution ( for increasing the size of the image from the low level noise)
But , I did not quite understand how we get it just by passing the number of channels.

Nydia · January 9, 2024, 9:43am

Please consider that the size of the filters is not determined by the image dimensions (pixels), but by the number of channels. In summary, this helps to get the proper size of the filters. Where each channel, corresponds to a particular aspect or feature in the input data.

Wendy · January 12, 2024, 12:35am

@Prasanna_Hegde, a little about your question #1 about the image in the notebook:

The main thing to keep in mind is that this is a diagram from a paper that was using different values in its model, so it’s best to think of it as just giving the general idea of what’s going on, rather than any specifics. For example:

1). The diagram shows the generator starting with a noise vector and going through several transpose convolution blocks to finally create an image-sized output with the appropriate number of channels. (In the case of the diagram, the final output is a 64x64 image, with 3 channels. In the assignment, the final output is a 28x28 image with one channel (since our assignment is for a black&white image)).
2). The diagram shows the general pattern of the “image” size increasing for each block while the number of channels decreases, with the image going from 4x4->8x8->16x16->32x32->64x64 and channels going from 1024->512->…->3. Similarly, with each block in our assignment, the “image” size increases, while the channels decrease until we get to the size of a 28x28 image with one channel.

Prasanna_Hegde · January 12, 2024, 3:06pm

Thank you Nydia and Wendy

Topic		Replies	Views
Requesting clarification on Image Generation Build Basic Generative Adversarial Networks week-module-2 , week-module-3	6	396	September 6, 2023
Size of image created by generator Build Basic Generative Adversarial Networks week-module-2 , week-module-3	4	546	February 28, 2023
Question about behind the scenes of the Generative Adverserial Network Build Basic Generative Adversarial Networks week-module-1	2	383	September 1, 2023
Understanding sizes in GANs Generative Adversarial Networks (GANS)	14	327	September 8, 2021
C1_W2_Assignment: How is the image dimension even provided? Build Basic Generative Adversarial Networks week-module-2 , week-module-3	1	423	June 25, 2023

Working of DCGAN

Related topics