Difference in GRULM implementation and LSTM

arvyzukai · October 1, 2023, 11:48am

There is a common mistake of understanding what is a “layer” what is a “unit”. So, in GRULM there are 2 “layers” of GRU with 512 “units” each. While in the LSTM there is 1 “layer” with 50 “units”.
The terminology is confusing and if you want to find more about it, you can read this post, if it’s confusing - don’t worry.

Another common misconception is that RNN’s number of units is dependent on sentence length. That is not true. RNN’s number of units is the number of how many inputs and outputs does the RNN layer receives and outputs. In the GRULM case, the inputs are 512 dimensional vectors and the outputs are also 512 dimensional vectors (each “unit” produces its own output). So, the number of units is just the number of outputs you want from the layer.

I recently answered a similar question where you can find concrete calculations which might be informative or confusing

Cheers

Topic		Replies	Views
Why are we allowed to choose the number of units of an LSTM layer? Sequences, Time Series and Prediction week-3	4	563	March 22, 2022
C4W1: Quick question - Number of LSTM units in the model NLP with Attention Models week-1	1	414	March 2, 2024
GRU unit for RNN Sequence Models	1	523	June 2, 2022
Questions on inputs for GRU model NLP with Sequence Models week-2	5	733	March 9, 2023
GRU assignment n_layers argument NLP with Sequence Models week-2	4	589	July 18, 2022

Difference in GRULM implementation and LSTM

Related topics