Hi,
While I completed the programming assignment for Week 1, I have some lingering questions about a few aspects. Your help is greatly appreciated!
-
Since the discriminator’s output has only 1 neuron/node, I just want to confirm the discriminator is doing a binary classification/logistical regression of whether an image is merely a digit BUT without further classifying the image is a “0”, “1”, … “9”. This is consistent with the example of generating an image of a cat but the noise vector is the “seed” for what type/breed of cat will be generated
-
In the optional part of the programming assignment, I trained the GAN for 90K+ steps and got the following images from the generator. Are they similar to what others have gotten from using the default setup for batch size and # epochs? I am trying to figure out if these images are within expectations
-
Regarding the comment on one would want the generator and the discriminator to be always similar in capabilities, the top of the image below also captured the generator’s loss ended up being ~2.5x of the discriminator’s loss. Is 2.5x considered as a sizable imbalance between the 2 loss metrics? If so, has anyone considered active monitoring the imbalance between the 2 loss metrics during training, and when an imbalance arises dynamically invoke more training for the block with higher loss until the imbalance is gone? I admit this will make the training non-deterministic
-
For training the generator, instead of using deterministic #'s for batch size and # epochs, has anyone considered simply keep training until the loss drops below a threshold? Probably still need to OR this with # epochs but the # epochs is set to a very high value as a safety net