W1 A2 How is the model able to learn?

jakhon77 · December 15, 2024, 10:38pm

In the “Dinosaurus Island Character Level Language Model” assignment, HOW is the model learning when the loss remains almost the same as the initial loss, around 23?

balaji.ambresh · December 16, 2024, 3:45am

This lab is meant for learning purposes on sequence modeling (character level). As stated in the markdown at the end of the notebook, please experiment with different hyperparameters for better results.

That said, have you seen this?

jakhon77 · December 17, 2024, 3:58am

Thank you! However, I’m not worried about the model’s performance, as it is performing fine with the current hyperparameters. What I’m curious about is how the model is learning this well even when the loss does not decrease. If you take a look at the loss during the training it stays around 23. Nevertheless, the model still learns well.

TMosh · December 17, 2024, 4:03am

I believe the loss is decreasing, just not very much. Because if it didn’t decrease, the accuracy would not increase. In classification, Cost and Accuracy both measure essentially the same thing.

Topic		Replies	Views
Dinosaurus_Island_Character_level_language_model Exercise 4 Sequence Models week-1	3	25	October 17, 2024
Conceptual aspect of Dinosaurus Island -- Character level language Sequence Models	1	635	September 10, 2021
A doubt in Week 1 Assignment Sequence Models	3	291	December 11, 2023
Week 1 Assignment: Dinosaurus_Island_Character_level_language_model Sequence Models	1	516	January 15, 2023
Character level language model - Dinosaurus Island - results are not matching Sequence Models week-1	2	12	January 19, 2025

W1 A2 How is the model able to learn?

Related topics