Course 5 week1 dinosaur model, loss exploding after 5 iterations

marcseb · May 21, 2021, 5:02pm

Hi, The model seems running but the loss is exploding after few iterations. Here are the outputs:
single_example = turiasaurus
single_example_chars [‘t’, ‘u’, ‘r’, ‘i’, ‘a’, ‘s’, ‘a’, ‘u’, ‘r’, ‘u’, ‘s’]
single_example_ix [20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19]
X = [None, 20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19]
Y = [20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19, 0]
…

curr loss= 58.12372726116307
curr loss= 50.29752492287171
curr loss= 151.48395730840713
curr loss= 1566.987862638166
curr loss= 4024.7637708785633
curr loss= inf
curr loss= inf

The call to the optimizer seems OK:
curr_loss, gradients, a_prev = optimize(X, Y, a_prev, parameters, learning_rate = 0.01)

Can anyone help me on this topic ?
many thanks in advance
Marc

marcseb · May 24, 2021, 4:41pm

Hi,
Even if this particular assignment has already been graded with success (but only up to 75/100), I would be pleased to know where my code fails. Who of the staff could I send my code to in order to understand my error ?
Thanks
Marc

edwardyu · May 25, 2021, 1:27am

Hi,
Could you DM me your code? Thanks.

marcseb · May 25, 2021, 12:14pm

Hi Edward,

Thank you for your prompt answer. Please find attached my code.

Best regards

Marc

Scarica Outlook per Android

(Attachment Dinosaurus_Island_Character_level_language_model.json is missing)

bleg · May 26, 2021, 2:39pm

Me too, ran into that.
It turns out your optimize() function updates parameters to the wrong directions of gradient (that is not checked in tests)
Revert the sign and you’ll be good.

marcseb · May 26, 2021, 4:01pm

Thank you Edward and Gleb for your support, now it works fine

Mhmemeth · August 29, 2021, 4:21pm

Hi, My problem is the opposite. Instead of exploding, loss seems to drop very quickly. All tests pass before exercise 4. The algorithm starts fine, then drops in loss immediately. Is there any place to me to, so I can see what I’m doing wrong? I have reloaded the Workspace twice, tried rebooting the server, restarted kernel and and cleared output, but same problem.

j = 0 idx = 0
single_example = turiasaurus
single_example_chars [‘t’, ‘u’, ‘r’, ‘i’, ‘a’, ‘s’, ‘a’, ‘u’, ‘r’, ‘u’, ‘s’]
single_example_ix [20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19]
X = [None, 20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19]
Y = [20, 21, 18, 9, 1, 19, 1, 21, 18, 21, 19, 0]

Iteration: 0, Loss: 23.087336

Nkzxwtdmfqoeyhsqwasjkjvu
Kneb
Kzxwtdmfqoeyhsqwasjkjvu
Neb
Zxwtdmfqoeyhsqwasjkjvu
Eb
Xwtdmfqoeyhsqwasjkjvu

j = 1535 idx = 2
j = 1536 idx = 3
Iteration: 2000, Loss: 9.894264

Iavosaurus
Dosaurus
Esitosaurus
Iaecerus
Unus
Amanonelhoneris
Tosaurus

…

Iteration: 22000, Loss: 0.294620

Iavesaqr
Esitoriasaurus
Esitoriasaurus
Iaeaurus
Urus
Andoravenator
Saurur

Thanks, Mark N.

TMosh · August 30, 2021, 3:23am

@Mhmemeth, your post is a duplicate, I replied on your other thread.

Topic		Replies	Views
Course 5 week1 dinosaur model, loss drops below 1.0 after 6k iterations: Any guidance is appreciated Sequence Models	2	561	August 30, 2021
Dinosaurus model - W1A2 Exercise 4 - model Sequence Models	3	839	September 3, 2021
DLS C5W1A2E4, Type Error Sequence Models	1	620	August 18, 2021
Course 5 Week 1 Assignment 2 Exercise 4 - model() Sequence Models week-1	2	373	February 18, 2024
Problem with Dinosaurus Island Character Sequence Models	6	257	December 22, 2023

Course 5 week1 dinosaur model, loss exploding after 5 iterations

Related topics