InfoGAN and Mutual information

IvanK · September 3, 2021, 11:02am

Hey
I have a question about InfoGAN presented in the optional notebook. Could someone help me form an intuitive understanding of how we get this latent code C? I tried reading the article but it is quite complicated.

I understand that we have a complementary loss and a small additional neural network but I can not figure out how we train the latent code to represent some meaningful features

It be grate if someone could explain this approach step by step
For example:
generate noise and latent code randomly at first → generate images → run through the discriminator- > … → …

sahar.drfsh · September 4, 2021, 1:16pm

Hi @IvanK
This link explains InfoGAN step by step and I hope it helps:

28utkarsh · September 5, 2021, 8:13pm

Hi @IvanK

As you have already understood, the latent code c is the noise that will represent meaningful features. The c is also generated randomly along with incompressible noise z. The z will give a model a freehand to map the representation on its own, but c will put some restrictions on semantic feature mapping. They are both concatenated together and passed to the generator, and therefore, the generator’s output is represented as G(z, c).

The overall goal is to maximize the mutual information between c and G(z, c) so that unlike z in a normal GAN, the information c is not lost when the generator is generating the fake samples. In other words, maximizing mutual information means that from the fake samples, i.e. output G(z, c), the latent code c can be extracted completely, and thus no information is lost. Hence, you are reducing the entropy (randomness) of latent code by extracting c, or, the mutual information is maximized.

Intuitively, the latent code c will capture the pertinent features (that can be used later to do controllable generation) because the generator is regularized by the mutual information. The way of re-extraction of c is performed by assuming a prior distribution (Gaussian in the case of Notebook) using an auxiliary network that tries to estimate the lower bound of the information I(c, G(c, z)) explained in the paper. The auxiliary network in the notebook predicts the mean and variance of Normal Distribution through which c can be reconstructed.

Now coming to the training steps given in the notebook:
Step 1. The generator takes concatenated (z, c) as input and produces G(z, c) as fake output.

Step 2. In this step, the discriminator takes G(z, c) as input. The discriminator has got two heads, one for predicting as usual as the image as fake or real, and the other one for predicting the mean and variance of latent code distribution.

Step 3. This step is similar to the previous step as this time the discriminator takes real inputs and calculates the same values.

Step 4. The discriminator’s backward pass is performed after calculating the adversarial loss and mutual information, and hence, the D’s parameters are updated.

Step 5. Now it’s time for the generator to learn from the adversarial loss of the discriminator as well as maximizing the mutual information between c and G(z, c). Hence, the fake samples are passed to the discriminator. The adversarial loss along with the mutual information is calculated and backpropagation is performed to update the generator’s parameters.

IvanK · September 5, 2021, 8:47pm

Thank you! I will take a look

IvanK · September 5, 2021, 8:48pm

Thank you very much!
Now it’s much more clear

Topic		Replies	Views
Trying to understand the c_criterion in InfoGAN Build Basic Generative Adversarial Networks week-4	2	531	April 29, 2022
Basic question in Build basic generative Adversial network C1W1 and C2W2 assignments Build Basic Generative Adversarial Networks week-1 , week-2	6	423	January 23, 2024
Questions about paper from GAN to WGAN by Lilian Weng Build Basic Generative Adversarial Networks week-3	8	336	March 27, 2024
Having trouble understanding how to combine layers together to answer UNQC6 Build Basic Generative Adversarial Networks week-1	7	632	November 19, 2021
Question about behind the scenes of the Generative Adverserial Network Build Basic Generative Adversarial Networks week-1	2	383	September 1, 2023

InfoGAN and Mutual information

Related topics