C3W1 - Confused with Embedding and Mean Layers

dsong99 · November 8, 2025, 3:11pm

I am confused with NLP C3W1 introduction, I don’t understand how embedding for each word has multiple dimensions and needs to take a mean on them (Per discussion on https://community.deeplearning.ai/t/trax-mean-layer/230473: the embedding size is 2 (each column represent different embedding/feature)). From C2, embedding is a column or row of the weight matrix, ether the 1st or the 2nd one assume we use two layers.

in the instruction, does it imply we use two layers NN, and take a mean of these two?

I don’t understand why the embedding size is 2?

gent.spah · November 9, 2025, 8:12am

The “embedding size = 2” does not mean the model has two layers. It simply means each word is represented by a 2-dimensional embedding vector, chosen so the course can easily visualize the embedding space.

Every word becomes a point in a 2D space, and the Mean layer computes the average of these vectors across all words in a sentence, producing one fixed-size representation. This averaged vector is then passed to a simple dense layer for classification. The key idea is that we are averaging features, not averaging layers — the embedding dimension is just the size of the word vectors.

dsong99 · November 9, 2025, 4:53pm

got it, thanks

mirrornetinquir · November 10, 2025, 3:28am

The model already knows the right answer. The problem is stability under compression. Compression-Aware Intelligence (CAI) measures that

TMosh · November 10, 2025, 4:39am

Language models don’t contain “right answers”. They contain correlations between sequences of words.

Topic		Replies	Views
Subtle, confusing errors in C3W1 notebook explanation of Mean NLP with Sequence Models week-module-1	2	499	March 28, 2023
Mean Layer in C3_W2_Assignment NLP with Sequence Models week-module-2	4	477	May 23, 2023
Trax and mean layer NLP with Sequence Models week-module-1	4	597	December 3, 2022
Trax mean layer NLP with Sequence Models week-module-3	3	508	November 15, 2022
C3W1 - Unable to understand "mean layer" NLP with Sequence Models week-module-1	1	275	March 17, 2024

C3W1 - Confused with Embedding and Mean Layers

Related topics