Trax and mean layer

Xixi_NXCR · December 11, 2021, 8:01pm

The initial equation is Z = WX + b, so W has dimension of Nh, where is N is the size of vocabulary and h is the size of embedding. In trax at one point in the assignment, the dense layer becomes x * w, which changes the dimension of w. But the text did not explicitly define the dimension of w. Later in the mean layer, the mean is on axis=1, and it requires average across N, which means now w is of dimension (h, N). Can someone please help that I understand this correctly. Thanks.

reinoudbosch · July 1, 2022, 9:20pm

Hi Xixi_NXCR,

In case you still have this question: As far as I can see, you are correct in your understanding.

Remington_Lambie · November 25, 2022, 7:43pm

I am trying to understand this as well. Is the output dimension of the mean layer (h, 1)? Where h is the embedding size. This would mean that we get for each word in our vocabulary the mean of all its embedding values of that word that have been altered from the weights and bias from passing through the embedding layer. Is that correct?

reinoudbosch · November 28, 2022, 2:12am

Hi Remington_Lambie,

As I understand it, the output dimension of the embedding layer is (batch_size, vocab_size, embedding_dim). Mean takes the average over the vocab_size (axis =1), so you end up with (batch_size, embedding_dim).

Akshay_Kumar_Pansari · December 3, 2022, 1:49pm

Thanks, this helped me

Topic		Replies	Views
Subtle, confusing errors in C3W1 notebook explanation of Mean NLP with Sequence Models week-1	2	487	March 28, 2023
Trax mean layer NLP with Sequence Models week-3	3	487	November 15, 2022
Question about Dimension of Model Input NLP with Sequence Models week-1	1	510	August 10, 2022
Mean Layer in C3_W2_Assignment NLP with Sequence Models week-2	4	466	May 23, 2023
Typo in Week's assignment notebook NLP with Sequence Models week-1	1	528	April 10, 2023

Trax and mean layer

Related topics