Loss funtion in dinosaurus_Island assignment

Soumitra_Das · October 6, 2022, 1:17am

What is the loss function in the dinosaur assignment?

TMosh · October 6, 2022, 1:37am

You can determine this by reading the rnn_forward() function in the utils.py file.

Soumitra_Das · October 6, 2022, 2:01am

Yes, I have seen that but its not clear to me.[It does not look like the one described in the lecture] Can you please write the exact mathematical formula.

paulinpaloalto · October 6, 2022, 11:23pm

Forward propagation uses softmax at the output layer, so it is the standard cross entropy loss that is used with softmax. At each timestep it is:

L(y,\hat{y}) = -y * log(\hat{y})

The only other subtlety is that they are summing the loss across all the timesteps. The comment could be a little clearer there: they call it subtraction, but the terms are negative so you’re really adding them. Well, they actually call it “substraction” (sic).

All the indexing business is just selecting the element of the vector that corresponds to the one hot label at that timestep. It all boils down to -log(\hat{y}) for the element that corresponds to the “true” label for that timestep. Which is absolutely “flavor vanilla” cross entropy loss …

Topic		Replies	Views
Conceptual aspect of Dinosaurus Island -- Character level language Sequence Models	1	635	September 10, 2021
Dinosaurus_Island_Character_level_language_model optimize Sequence Models week-1	5	490	January 7, 2024
Week 1 questions Sequence Models	1	525	December 26, 2021
Loss Function of Week 3 Neural networks topic Neural Networks and Deep Learning	4	646	February 12, 2024
Week 1 RNN Concepts Sequence Models	5	566	May 29, 2023

Loss funtion in dinosaurus_Island assignment

Related topics