C4W3 Lab1 - confuses standard deviation with log-variance

istvan · December 16, 2021, 9:07pm

In the explanatory text of this Lab, it is stated that the encoder part outputs the mu (mean) and sigma (standard deviation) of the latent representation. However, in fact it outputs the natural logarithm of the variance instead of the standard deviation. Thus what is called sigma in the code, is actually log(sigma**2) if we use the standard notation where sigma is the standard deviation. This is immediately obvious from both the computation in the Sampling class and the KL-divergence part of the loss function kl_reconstruction_loss().

It would be great if this could be corrected in order to prevent possible confusion for learners.

I also have a question about this: does it have any advantage to predict the log variance wrt predicting the sigma with the encoder? If so, there could be a sentence or two about this in the Lab, or at least a relevant link.

gent.spah · December 21, 2021, 3:50pm

Usually the log of something is more computationally stable thats why is used in tensorflow.

Topic		Replies	Views
[Question] Can someone explain to me how this snippet works? Generative Deep Learning with TensorFlow week-3	5	586	August 21, 2023
What's the difference in mu and sigma in the code? Generative Deep Learning with TensorFlow week-3	2	405	October 16, 2023
Reparameterize trick Custom and Distributed Training with TF week-3	3	573	September 20, 2021
Questions about Encoding Model & Sampling Layer Generative Deep Learning with TensorFlow week-3	2	572	September 2, 2023
C2W2 VAE - mean and stddev Build Better Generative Adversarial Networks week-2	4	560	October 25, 2022

C4W3 Lab1 - confuses standard deviation with log-variance

Related topics