Train steps stop at 1, while it is set to 5 (UNQ_C4)

cmosguy · February 23, 2023, 3:43pm

Hello, for some reason the train steps keep stopping at 1 and not continuing to 5. Why does it stop so early?

Thanks!
Adam

cmosguy · February 23, 2023, 3:47pm

BTW, when I re-run the same cell again, I do not get any outputs:

arvyzukai · February 24, 2023, 5:45am

Add n_steps_per_checkpoint=1 (the number you want) in the training.TrainTask(...) (in the train_model function definition).

Cheers

cmosguy · February 24, 2023, 3:21pm

@arvyzukai thank you for your reply. n_steps_for_checkpoint is a logging feature flag from what I can tell.

What I am trying to understand is what do I need to do to train the model for longer to make it more accurate? Your notebook assignment comes with a model.pkl.gz. What do I need to run the model longer to match what you pretrained the model to?

Thanks!

arvyzukai · February 24, 2023, 3:56pm

@cmosguy

Oh, you did everything correct - more train_steps = more training. I though you wanted to see the progress more frequently (default setting is 500 if I remember correctly).

I think the Coursera backend would not like you training too long (it probably would take a long time and multiply that by amount of students … ).

You can try setting `lr=0.001’ and running for 500 steps would give you some decent model but you might not want to abuse it

Cheers

P.S.
Btw I realized, that depending on your experience with trax you might have troubles, so if you would want to try it yourself you should change these bits in the code:
In # UNQ_C4:

add n_steps_per_checkpoint=50 inside the training.TrainTask(...
do not initialize the Siamese variable inside training.Loop(... just use Siamese without brackets

In the following cell:

modify train_steps = 750
pass the model to the training loop like: training_loop = train_model(model, .. (not the Siamese class as in the assignment)

NOTE!!! : this will mess your assignment and you would not pass the grading, so experiment with a backup or when you passed the assignment. Again - this is not a solution for future readers

cmosguy · March 7, 2023, 3:07pm

Hey @arvyzukai this was really helpful. Thanks for clarifying how to run the loop longer. It is much appreciated.

Topic		Replies	Views
C3_W1_Assignment - Exercise 6 - train_model NLP with Sequence Models week-1	2	507	April 24, 2023
C3_W2: Exercise 4 test function running infinitely NLP with Sequence Models week-2	2	441	July 13, 2023
Training Siamese Network for more than 2 epochs in C3W3 NLP with Sequence Models week-3	8	48	December 13, 2024
C3W3_Assignment - train_model NLP with Sequence Models week-3	2	472	February 1, 2024
C3_W1 excercise 6: Encountered an unexpected tracer NLP with Sequence Models week-1	3	599	March 7, 2022

Train steps stop at 1, while it is set to 5 (UNQ_C4)

Related topics