Hello, for some reason the train steps keep stopping at 1 and not continuing to 5. Why does it stop so early?
Thanks!
Adam
Hello, for some reason the train steps keep stopping at 1 and not continuing to 5. Why does it stop so early?
Thanks!
Adam
Hi @cmosguy
Add n_steps_per_checkpoint=1 (the number you want) in the training.TrainTask(...) (in the train_model function definition).
Cheers
@arvyzukai thank you for your reply. n_steps_for_checkpoint is a logging feature flag from what I can tell.
What I am trying to understand is what do I need to do to train the model for longer to make it more accurate? Your notebook assignment comes with a model.pkl.gz. What do I need to run the model longer to match what you pretrained the model to?
Thanks!
Oh, you did everything correct - more train_steps = more training. I though you wanted to see the progress more frequently
(default setting is 500 if I remember correctly).
I think the Coursera backend would not like you training too long
(it probably would take a long time and multiply that by amount of students … ).
You can try setting `lr=0.001’ and running for 500 steps would give you some decent model but you might not want to abuse it ![]()
Cheers
P.S.
Btw I realized, that depending on your experience with trax you might have troubles, so if you would want to try it yourself you should change these bits in the code:
In # UNQ_C4:
n_steps_per_checkpoint=50 inside the training.TrainTask(...Siamese variable inside training.Loop(... just use Siamese without bracketsIn the following cell:
train_steps = 750model to the training loop like: training_loop = train_model(model, .. (not the Siamese class as in the assignment)NOTE!!! : this will mess your assignment and you would not pass the grading, so experiment with a backup or when you passed the assignment. Again - this is not a solution for future readers ![]()
Hey @arvyzukai this was really helpful. Thanks for clarifying how to run the loop longer. It is much appreciated.