Why is Units same as size of vocabulary for dense layer Doubt

Deepti_Prasad · March 31, 2024, 10:48am

Hello,

I don’t know if I understood this correctly or not, so wanted to confirm here

The dense layer with logsoftmax activation in the Decoder cell uses units as vocab_size, and the instructions mentions the reason as

Finally a Dense layer. This one should have the same number of units as the size of the vocabulary since you expect it to compute the logits for every possible word in the vocabulary.

So this is because number of hidden units are suppose to be equal to vocab size as per what I learnt from the video instructor for the translator to be able translate or decode the words?

is my understanding correct, if not please enlighten me.

Thank you in advance.

Regards
DP

gent.spah · April 1, 2024, 7:00am

Hi @Deepti_Prasad,

You mean this in unit in the translator: units (int): Number of units in the LSTM layer?

Deepti_Prasad · April 1, 2024, 10:02am

No the dense layer, that’s what the code mentioned, do you want me to share the code? I didn’t share as it was part of graded codes.

Thank you gent.

Regards
DP

gent.spah · April 1, 2024, 4:00pm

The final dense layer has the same number of neurons as the vocab size from what i understand for the reasons you also mention!

rmwkwok · April 1, 2024, 7:40pm

It sounds reasonable to me too, that the output layer has the same size as vocab size so that it can predict the probabilities for each of the vocab.

jyadav202 · April 2, 2024, 5:43am

That’s right what @rmwkwok said. I would like to make an amend to @Deepti_Prasad your comment about the Dense layer being the hidden units, but in fact it is the output layer of the Decoder.

Deepti_Prasad · April 2, 2024, 8:07am

So my reasoning of understanding is correct, thank you everyone

Regards
DP

Topic		Replies	Views
Week 4 Assignment Transformer Architecture: Linear Layer before Softmax Sequence Models coursera-platform	2	731	May 24, 2021
What is the dense layer for in week 3 assignment? NLP with Sequence Models week-module-3	3	586	April 11, 2022
Dense layer with 5 units (since there are 5 categories) with a softmax activation Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	3	611	August 9, 2022
UNQ_C6 Number of units in Dense Layer NLP with Attention Models week-module-2	3	568	May 2, 2022
Are there dense layers in word2vec? Sequence Models coursera-platform	7	578	September 24, 2021

Why is Units same as size of vocabulary for dense layer Doubt

Related topics