How does trax word embedding layer work?

arvyzukai · September 15, 2022, 9:07am

The Embedding code is very simple - Embedding code

As you can see it is just one line of code for forward propagation:

jnp.take(self.weights, x, axis=0)

What it does, it just “takes” the x’th rows from the weight matrix (self.weights). So if you have Embedding matrix with vocabulary of length 20 and embedding dimension 4 (shape (20, 4)):

and you pass your batch of two sentences (for example, x of shape (2, 4):

the Embedding layer will return you shape (2, 4, 4) adds one dimension - the Embedding size dimension:

This is all that it does - it takes input of shape*(batch_size, seqlen)* and outputs the shape (batch_size, seqlen, emb_size).

Topic		Replies	Views
Question for the vector representation NLP with Attention Models week-1	3	563	April 27, 2023
Trax and mean layer NLP with Sequence Models week-1	4	574	December 3, 2022
About word embeddings in the CBOW model NLP with Probabilistic Models week-4	1	519	December 1, 2022
Trax mean layer NLP with Sequence Models week-3	3	487	November 15, 2022
How are word embedding calculated end to end NLP with Sequence Models week-1	6	599	January 10, 2023

How does trax word embedding layer work?

Related topics