Is there a reason for this?

Jayrami1 · June 2, 2025, 11:33am

For the assignment C2W2A1 , if we increase the epochs we get these peaks at intervals in cost v/s iteration plot with continuing descend afterwards. What is the reason behind this and is there any way to optimize this ?

TMosh · June 2, 2025, 8:11pm

Sorry, for some reason I no longer have access to that lab, so I can’t run it to investigate this issue.

mappx · June 3, 2025, 4:07pm

Hi,

If I restart the kernel after each change made to the number of epochs, everything goes smooth. Without restarting the kernel, the result seems erroneous indeed.

Steps:

kernel restart after the first run of plot_loss_tf(history)
epochs = 200
run all
result OK
no kernel restart
epochs = 100
rerun plot_loss_tf(history)
result NOK

TMosh · June 3, 2025, 4:30pm

In general, the notebooks contain a lot of global variables, and if you just change some aspect of the model configuration, and then train it again, that won’t always reset the workspace fully.

FYI, that’s a big cause of weird behavior in some of the other courses (like the Deep Learning Specialization).

But that is a strange bit of behavior that might be worth exploring further. Perhaps it has to do with exactly what plot_loss_tf() is doing.

mappx · June 3, 2025, 5:01pm

Indeed, if we rerun the cell containing model = Sequential everything goes OK, too. So the class keras.models.Sequential needs to be re-instantiated. Note: without re-instantiating it, the loss keeps going down from where it stopped previously, kind of epochs+= new_num_epochs

TMosh · June 3, 2025, 5:50pm

It’s because the weight values are initialized only when the model object is created.

Topic		Replies	Views
Some wierd number for loss after 12'th epoch Advanced Learning Algorithms week-module-1	4	472	January 5, 2023
Course 2, Week 3, Kernel error and cost computation problem! Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	5	28	July 16, 2024
C1_W2 Kernel Restarting Issue Introduction to TF for Artificial Intelligence ...	1	315	January 16, 2024
C4_W3_A2 -> Training Additional Epochs Convolutional Neural Networks coursera-platform	3	597	December 22, 2022
'Sticky' model not retraining after optimal result Custom Models, Layers and Loss Functions with TF week-module-2	10	185	April 20, 2024

Is there a reason for this?

Related topics