C5, W1, A2 - dinosaurs character language modelling

Akingbeni_David · January 1, 2023, 6:51pm

I’m struggling to understand the instruction that is set for the last question about training the model, shown below:

When examples[index] contains one dinosaur name (string), to create an example (X, Y), you can use this:

Set the index `idx` into the list of examples

Using the for-loop, walk through the shuffled list of dinosaur names in the list “examples.”
For example, if there are n_e examples, and the for-loop increments the index to n_e onwards, think of how you would make the index cycle back to 0, so that you can continue feeding the examples into the model when j is n_e, n_e + 1, etc.
Hint: (n_e + 1) % n_e equals 1, which is otherwise the ‘remainder’ you get when you divide (n_e + 1) by n_e.
% is the modulo operator in python.

Akingbeni_David · January 1, 2023, 8:05pm

I already figured this out. In case anyone had the same challenge at the beginning, kindly take note of the following:

The code expects a single training sample per iteration and not the whole dataset. This is unlike a standard feed-forward network, where each iteration takes in the whole dataset in mini-batches.
However eventually with the number of iterations (35K), the network ends up going through the samples more than once. Think of it as having a batch size of the whole dataset.

I hope I am correct about this.

UPDATE: I am still yet to obtain the correct answer. I now have the model outputting some interesting results. The loss I am outputting is actually lower than that of the grader. I cannot seem to place where the error is. I have some cool dinosaurs name too:

The last lines of my output look something like this:

Iteration: 20000, Loss: 21.056823

Rixtstapnosaurus
Miceadsomabosaurus
Owutoosaurus
Rabaessaacitatornythaycerogavsaurus
Zuromibosaurus
Haadropcarus
Yuocheroptosaurus

Iteration: 22000, Loss: 20.578871

Hutusaurus
Euca
Eustrioppn
Hocamptopanceus
Xuspeodon
Elacropechus
Uspeodon

paulinpaloalto · January 1, 2023, 9:11pm

One common mistake is to use the direct inputs, as opposed to using the “shuffled” version that they generate for you in the template code.

Akingbeni_David · January 2, 2023, 1:29am

Thanks @paulinpaloalto

Just adjusted the code and it worked.

Topic		Replies	Views
Dinosaur Island Exercise 4 model Sequence Models	1	635	March 3, 2022
Dinosaur assignment - model function Sequence Models	1	470	August 22, 2023
C5-W-1A2, function 'model' got passing the tests Sequence Models week-1	4	40	August 14, 2024
C5W1A2 Dinosaurus - model() - idx initialisation Sequence Models	1	705	June 30, 2022
C5 W1 A2 Dinosaur Island - "Reasonable" but wrong output names Sequence Models	3	560	February 6, 2023

C5, W1, A2 - dinosaurs character language modelling

Set the index idx into the list of examples

Related topics

Set the index `idx` into the list of examples