Emojifier-V2 Hyperparameter Tuning

hungng777 · May 31, 2024, 8:53pm

In the function Emojify_V2(), we set LSTM’s number of units to 128 and Dense layer number of units to 5.

How is the number 128 derived? How is the number 5 derived? Is the number 5 the result of 5 emoji types used?

Thanks in advance.

paulinpaloalto · May 31, 2024, 10:02pm

The size of the hidden state for any kind of sequence model (RNN, GRU, LSTM) is just a choice you make. If you pick too big a number it may slow down your training a bit, but a bit of “overkill” is probably not a big problem.

For the Dense layer output, they tell you in the instructions for that section that it’s a 5 class softmax output:

* The model outputs a softmax probability vector of shape (m, C = 5).

Topic		Replies	Views
Course 5 Week 2 Assignment 2 Emojify_V2 Sequence Models	2	687	June 30, 2022
Question about Sequence Models Week 2 Emojify Sequence Models	2	372	September 14, 2023
Number of inputs to LSTM Sequence Models	3	463	May 27, 2023
Emojify V2 and LSTM Units Sequence Models	2	617	October 14, 2021
C5: W2: Emojify_v2: LSTM Layers Clarification Sequence Models	2	647	November 22, 2021

Emojifier-V2 Hyperparameter Tuning

Related topics