Why multiply one hot encoder with word embedding matrix when we can just extract the column?

Abdur_Rahim_Sheikh · June 24, 2022, 9:07am

This lecture tells about the uses of word embedding in our projects. My question is if we need the column of a word why do we matrix multiply with the words one hot encoding why not just extract the column. As we specifically know which column is going to be the output?
Why do unnecessary computation?

TMosh · June 29, 2022, 2:12am

The one-hot coding in the vocabulary is the traditional method of identifying a word without using its index value specifically. Using the index value would lead to unintended and misleading similarity between words. For example, if the vocabulary is in alphabetic order, then “ape”, “apostrophe” and “apple”, would appear to have similar numerical meanings, even though they are very different.

locquan.ai · December 18, 2024, 4:01am

I have similar question and still haven’t quite got it.

Doesn’t the model only focus on the output instead of the indices? As a simple example, I think this is analogous to declaring a simple list/matrix lst, where lst[0] = [5, 6, 7] and lst[1] = [8, 9, 0]. When use direct indexing, 0 and 1 will look similar yes, but the extracted values [5, 6, 7] and [8, 9, 0] are completely different, and the code works with inner values, not the index positions.

Given that the embedding matrix is learnable, the indexing operation (or the multiplication of one-hot encoding with the matrix) extracts 1 specific column out of it and the one-hot encoding is fixed across training session, I don’t see any reason why direct indexing would cause misleading similarity.

Could you please elaborate more on this point or give some other reasons for the original question? Thank you in advance.

Topic		Replies	Views
Course5week2, Question about Quiz of Natural Language Processing & Word Embeddings Sequence Models coursera-platform	4	528	June 4, 2023
Do we need matrix multiplication to get a word embedding? Sequence Models coursera-platform	3	599	March 4, 2023
Hey I am unable to comprehend this question Sequence Models week-module-2 , ai-discussions , coursera-platform	1	73	April 7, 2025
Question on Sentiment Classification Lecture Sequence Models week-module-2 , coursera-platform	6	301	January 19, 2024
Doubt - Sequence Models W2 lecture Sequence Models coursera-platform	5	535	March 6, 2023

Why multiply one hot encoder with word embedding matrix when we can just extract the column?

Related topics