Improvement of assignment write up

balaji.ambresh · November 20, 2021, 2:05pm

The assignment asks us to stop training when MSE loss reaches 320 and below.
When I did this, the grader failed my submission due to insufficient structural similarity.

Later, when the vae.losses were also factored in, the submission passed.

If you could revise the write up by mentioning that the overall reconstruction loss should be < 320, that’d be great.

After training till the loss < 310, the structural similarity was .66. Not sure if the tests have been revamped wrt to the assignment description. But, less than 310 could be a good point to stop training.

Cheers.

yusa711 · June 27, 2022, 12:40am

Follow up on this comment.

I tried with 200 epochs. The loss ended up around 180. But the restructured image is really off from the training data. It makes sense that the submitted model failed the grader.

After a few attempts, I noticed that the model’s output looks the best when the training loss is around 310. I manually stopped the running code, submitted the model and the grader passed this time.

My question is that why would the loss training loss model end up generating a worse model? Is the proper training method to monitor the sample output images and stop when they look good?

Thanks,
Yusa

Deepti_Prasad · July 14, 2023, 12:22pm

hello balaji can I know how many epochs you used for this assignment. I am doing the same assignment.

balaji.ambresh · July 14, 2023, 3:48pm

I don’t remember. Adding @gent.spah

Deepti_Prasad · July 14, 2023, 4:29pm

Balaji, I have a doubt, first when I was running the epoch for 5, it didn’t go to the required epoch. then I made epoch to 100. This time, epoch was running fine till 98 at which mse was 487 but then as it turned 99, mse turn nan and the training loop stopped. then after a while I tried again at epoch 50 but ran the notebook from beginning, it this time it was running properly and I was about to achieve the desired mse around epoch 40 but my power got cut off and I had to train again the model. At this time, again my epoch was at 40, but what I noticed mse started suddenly at 10,000 which was earlier beginning at 1500, 1100.

Can I know the reason of fluctuation as I am looking for solution for this problem.

My doubts are as follows-

Why every time I start training a loop, mse differs with the same code as I didn’t change anything.
I also noticed that as the epochs go higher, the learning rate tends to go slow. So does it mean keeping the epochs less and learning rate higher is better to train a model.
As I get this was a variation model, is that reason for the fluctuation in the training model???

@balaji.ambresh @paulinpaloalto

Thank you in advance
DP

gent.spah · July 14, 2023, 6:58pm

Hello @Deepti_Prasad the number of epochs should be similar to the ungraded lab but even maybe fewer number would do the job. You could try progressively based on your intuition.

Topic		Replies	Views
W3 Assignment - mean loss Generative Deep Learning with TensorFlow week-module-3	8	539	August 16, 2023
C4W3 assignment - grading problem of stderr Generative Deep Learning with TensorFlow week-module-3	5	564	December 13, 2023
Epoch issue WARNING:matplotlib.image:Clipping input data to the valid range for imshow with RGB data ([0..1] for floats or [0..255] for integers) Generative Deep Learning with TensorFlow week-module-3	2	750	August 5, 2023
C4 assign wk 3 -epochs Generative Deep Learning with TensorFlow	8	535	January 3, 2024
Mean loss dropping very slowly Generative Deep Learning with TensorFlow week-module-3	1	514	March 30, 2023

Improvement of assignment write up

Related topics