Model embedding layer dim

Taras_Buha · March 25, 2022, 7:50pm

Hi,

In Lab1 I tried dim = 8, same result.
How to choose correct embedding layer dim?

Thanks

adonaivera · March 29, 2022, 12:11pm

Thanks for reaching out.

There is no “right” answer to this question; there are many views on choosing the embedding_dimensiones.

For example, this google developer blog post says:

Well, the following “formula” provides a general rule of thumb about the number of embedding dimensions:

embedding_dimensions =  number_of_categories**0.25

The embedding vector dimension should be the 4th root of the number of categories.

The most important thing is to take in mind the following guidelines:

Embedding layer is a compression of the input; when the layer is smaller, you compress more and lose more data. When the layer is bigger, you compress less and potentially overfit your input dataset to this layer, making it useless.
If you have very sparse documents relative to the vocabulary, you want to “get rid” of unnecessary and noisy words - you should compress more - make the embedding smaller.
The more extensive vocabulary you have, you want a better representation of it - make the layer larger.

There are some recommendations from Tolik; note that this is just a general guideline; you can set the number of embedding dimensions as you please.

Hopefully, help

With regards,

Taras_Buha · March 29, 2022, 8:35pm

Thanks very much for so powerful information.
Very helpful answer.

Best regards, Taras

Topic		Replies	Views
What is the embedding layer dimension in C3 Wk 1? Natural Language Processing in TensorFlow week-1	1	611	December 10, 2021
How to decide optimal values of hyperparameters for embedding layer(output vector dimension and max length) based on data? Natural Language Processing in TensorFlow week-2 , week-3 , week-4	2	408	October 1, 2023
TF1,c3, w2 - Embedding layer param Natural Language Processing in TensorFlow week-2 , week-3 , week-4	1	500	March 3, 2023
Wk2 doubt about dimensions specified in Embedding() Sequence Models	14	691	October 9, 2022
C3W1 Practical Quiz NLP with Sequence Models week-1	1	539	July 1, 2022