Number of LSTM units in Trax

arvyzukai · October 20, 2022, 7:58am

For the sake of completeness, I can share my own calculations to check the inner workings of this weeks C3_W3 assignment. Maybe someone will find it useful.

The example of a batch:

image1321×675 28.3 KB
The example of the Embedding weights:

image1055×984 144 KB
The first sentence embedded example:

image1014×720 93 KB

Note that the same words have the same embeddings (highlighted in blue and orange).
The example of LSTM input weights for first layer W_ih_l0:

image1161×1309 155 KB
The example of LSTM hidden state weights for fist layer W_hh_l0:

image1029×1290 128 KB
The example of LSTM biases (for both input and hidden state):

image799×1289 43.5 KB

The example of calculations :

t = 0 (“Thousands”)

image1685×1021 96.6 KB
t = 1 (“of”)

image1688×860 98.5 KB
t = 2 (“demonstrators”)

image1690×859 97.8 KB
t = 17 (note jump to step 18 - the word “of”)

image1692×862 94.3 KB

Note:

You can compare the different values between words “of” in step t=1 and step t=17. Note that inputs (the embeddings are the same) but because of different hidden states c_16 and h_16, the output is different.

The example output of LSTM for the first sentence:

The example of Linear (Dense) layer weights (W and b):

image1145×882 91.6 KB

Topic		Replies	Views
C4W1: Quick question - Number of LSTM units in the model NLP with Attention Models week-1	1	415	March 2, 2024
Model architecture: Embedding dimension size and GRU number of cells NLP with Sequence Models week-2	8	1142	January 3, 2023
Why are we allowed to choose the number of units of an LSTM layer? Sequences, Time Series and Prediction week-3	4	563	March 22, 2022
Questions on inputs for GRU model NLP with Sequence Models week-2	5	745	March 9, 2023
LSTM layer size confusion? NLP with Sequence Models week-1	1	580	January 8, 2023

Number of LSTM units in Trax

The example of calculations :

Note:

The example output of LSTM for the first sentence:

The output of the model:

Related topics