Can someone help me understand this explanation from week2 in the emojify workbook

krithika_govindaraj · June 14, 2024, 3:46am

Run the following cell to create your model and check its summary.

Because all sentences in the dataset are less than 10 words, max_len = 10 was chosen.
You should see that your architecture uses 20,223,927 parameters, of which 20,000,050 (the word embeddings) are non-trainable, with the remaining 223,877 being trainable.
Because your vocabulary size has 400,001 words (with valid indices from 0 to 400,000) there are 400,001*50 = 20,000,050 non-trainable parameters.

Alireza_Saei · June 14, 2024, 5:44am

The model uses word embeddings with a vocabulary size of 400,001 words, each represented by a 50-dimensional vector, resulting in 20,000,050 non-trainable parameters for the embeddings (400,001 * 50). The total number of parameters in the model is 20,223,927, meaning 223,877 parameters are trainable, which include weights and biases (W, b) of other layers in the model. The choice of max_len = 10 is because the dataset sentences have a maximum length of 10 words. In this setup, the embeddings are fixed and not updated during training, and focuses on training the remaining layers.

Hope this help, feel free to ask if you need further assistance!

Topic		Replies	Views
Week2, emojify v_2, Embedding layer Sequence Models	2	526	April 21, 2022
Week 2 - Emojify - Emoji_v3a Sequence Models	5	957	August 7, 2021
Bug in W2A2/Emoji_v3a.ipynb Sequence Models	1	497	February 1, 2023
C5_W2_Emojify_V2 poor predictions Sequence Models	11	758	November 15, 2022
Week 2 Assignment 2 Emojify using an LSTM Sequence Models	4	406	July 21, 2023

Can someone help me understand this explanation from week2 in the emojify workbook

Related topics