C4_W3_A2 -> Training Additional Epochs

b0otable · December 22, 2022, 5:05pm

In the Step 4 - Train the Model
I changed the EPOCHS to 40, to see how the model would improve.

I ended up getting this where the model improved for awhile, then got really messed up and had to relearn and ended up only slightly better then it started.

I was curious what would cause such a massive drop in accuracy?

2nd attempt training with 40 Epochs:

3rd attempt training:

paulinpaloalto · December 22, 2022, 5:52pm

The cost surfaces are in very high dimensions and are incredibly complex. You can “go off a cliff” at any point in the training. It might be worth trying a different optimization algorithm that uses more sophisticated controls on learning rate and the like. Here’s a thread which talks a bit more about non-convexity and also links to a paper by Yann LeCun’s group that is worth a look for insight about whether reasonable solutions exist and can be found or not.

The other point here is that there is some randomness in TF/Keras training, meaning that you don’t get the same results every time, even if you try to set random seeds and control the randomness. I’ve been meaning to research the TF docs more to see if this is controllable, but haven’t gotten around to it yet.

TMosh · December 22, 2022, 6:09pm

A useful question is:
Are those changes in accuracy even meaningful when the number of iterations is so low?

balaji.ambresh · December 22, 2022, 10:20pm

Here’s tensorflow 2.11 api for better reproducibility across the same hardware.

Topic		Replies	Views
Exercise 1: Why the accuracy is just 0.5, even if I increase the epochs? Convolutional Neural Networks in TensorFlow week-1	3	623	January 6, 2022
C4W3 - UNet Assignment - Insight needed (accuracy crash !) Convolutional Neural Networks week-3	16	384	May 6, 2024
More complex model causes decreased accuracy? Convolutional Neural Networks in TensorFlow week-4	2	595	December 8, 2021
C4W2 Assignment 1, decreasing accuracy Convolutional Neural Networks	2	520	January 7, 2022
TFL Dev Prof C2-W3, not reaching 99.9 % even after 15 epochs Convolutional Neural Networks in TensorFlow week-3	3	605	January 18, 2023

C4_W3_A2 -> Training Additional Epochs

Related topics