C4 W4 A2 train_step 28 March 2022 update note

Peter_Darrell · March 28, 2022, 3:44pm

Hi guys

There is something not quite right with the notebook since today’s update, which coincided with me reaching the train_step exercise.

I noticed that the new notebook did not have my earlier code, so I rewrote each step, checking the results and all steps were passed.

I completed train_step and all steps passed, so I saved and checkpointed.
I re-highlighted the # UNQ_C4 cell by mistake, so I reran and then reran the next cell.

This time I got: "AssertionError: Unexpected cost for epoch 0: " and I checked several more times: each time the unexpected cost was different. Finally I reran all the cells above # UNQ_C4, even though code for the entire notebook had been written in one go, and it passed.

Maybe it needs an instruction to run ‘all cells above’ before running the train_step code?

ai_curious · March 28, 2022, 4:55pm

I’m thinking this is universally true since training depends on model state. If you execute training a second (or more) time without reinitializing the parameters, you will get different results because you are starting with a baseline of trained parameters instead of the initial- often random - values.

Peter_Darrell · March 28, 2022, 4:59pm

Thank you ai_curious.

I haven’t come across this before while working on Jupyter notebooks.

Good to know.

TMosh · March 28, 2022, 5:18pm

In this Specialization, the first debugging step is always “restart the kernel and run all the cells again”.

This is because many of the notebooks have internal states, so the sequence of running the cells can be critical.

Peter_Darrell · March 28, 2022, 5:20pm

Thank you TMosh

onward(s)

paulinpaloalto · March 28, 2022, 8:20pm

Not to put too fine a point on it, but what you’re telling us is that you simply have not been paying attention. All the notebooks have worked this way from the “get go”. They are stateful. But you lose the state anytime you close and reopen them or do a “kernel restart”. This was explained in Course 1 Week 2 when we first started on all this.

Of course statefulness also implies the point that Tom made: the order in which you run the cells or rerun them also matters in the general case.

Also note that there is this comment in the cell right after the one that defines train_step:

# Show the generated image at some epochs
# Uncoment to reset the style transfer process. You will need to compile the *train_step* function again

That is new. In the previous version, it simply said this:

# You always must run the last cell before this one. You will get an error if not.

Sigh. I will file a bug about the fact that they apparently can’t spell “comment”.

ai_curious · March 28, 2022, 8:31pm

A TensorFlow Model has an attribute that is a collection of Layers. Each Layer has a collection of weights, which are initialized when the Layer is instantiated. Unless they are designated as non_trainable_weights they are updated from the initial values during backprop. Those values persist in the Layer, and thus in the Model, after training completes and will be there until the Model is garbage collected or otherwise explicitly reinitialized. Especially since some of the initializers depend on random numbers, it is a good idea to restart and rerun the entire notebook in order to obtain repeatable output from training.

Useful references:

See for example the attributes list and the get_weights() method, which you could use to examine weights of one layer after successive train steps.

Cheers

Topic		Replies	Views
C4W4 assignment2 exercise6 Convolutional Neural Networks coursera-platform	2	599	January 12, 2022
DLS Course 4 Week4 A2: Total cost J is too high! Convolutional Neural Networks coursera-platform	4	730	January 25, 2022
C4 W4 A2 - P2 Art Generation E6 train_step (UNQ_C5) "Unexpected cost for epoch" Convolutional Neural Networks coursera-platform	23	757	May 10, 2023
Week 4 - assignment 2 UNQ_C5 error Convolutional Neural Networks coursera-platform	5	632	December 20, 2021
Course 4: Art Generation Exercise 6 Convolutional Neural Networks coursera-platform	1	574	July 18, 2022

C4 W4 A2 train_step 28 March 2022 update note

Related topics