Simple GRU initialization not working

arvyzukai · July 31, 2023, 3:56pm

The lab implementation is actually very accurate, it’s just not exactly the same as in the trax library. Depending on what are you trying to achieve, you can use the lab implementation to your original quest and everything should work and you wouldn’t need to read the code.

But if you want to understand the trax implementation, there is just simply too much to explain in a single post. I would encourage you to read the code line by line.

Here you have:

weights[0][0], shape (10, 10) - Update and reset gates weights (combined)
weights[0][1], shape (10, ) - Update and reset gates bias
weights[0][2], shape (10, 5) - Candidate weights
weights[0][3], shape (5, ) - Candidate bias

Cheers

Topic		Replies	Views
Coding concerning stacking GRUs NLP with Sequence Models week-2	5	613	January 16, 2023
Creating a GRU model using Trax NLP with Sequence Models week-2	3	734	July 26, 2022
Course 4 Week 1 Ex 6: ValueError NLP with Attention Models week-1	10	575	September 6, 2023
Error with next_symbol (UNQ_C6) NLP with Attention Models week-1	3	448	July 3, 2023
C3_W4 UNQ_C5 : problem with loading the weights NLP with Sequence Models week-4	10	741	October 25, 2023

Simple GRU initialization not working

Related topics