Natural Language Processing & Word Embeddings

edwardyu · July 5, 2021, 4:10am

Let’s take a look at skip-gram with a simple network below.

vocab_size = 3000
embeding_size = 100
model = tf.keras.Sequential([
    tf.keras.Input(shape =(vocab_size,), name='context_word'),
    tf.keras.layers.Dense(embeding_size, use_bias=False, name='embedding'),
    tf.keras.layers.Dense(vocab_size, activation='softmax', name='target_word')
] name='skip-gram')
model.summary()

This is a simple version skip-gram model. Just like Andrew said in lecture, the input is a one-hot vector with vocabulary size (3000 in the case), the output of hidden layer (embedding layer) is an embedding vector with embedding size (100 in the case), the Parameter # of embedding layer is the size of embedding matrix E (3000 x 100 in the case), and the output of output layer is a target word (Parameter is theta.)
Hopefully, it’s helpful.

Topic		Replies	Views
Learning word embeddings Sequence Models coursera-platform	1	622	August 4, 2021
DLS5 W2 Learing Word embeddings Sequence Models coursera-platform	7	574	September 12, 2023
How do we obtain the embeddings from CBOW? Sequence Models coursera-platform	1	514	October 11, 2022
Understanding the skipgram model Sequence Models coursera-platform	1	634	May 13, 2021
How does the embedding matrix appear in a neural network Sequence Models coursera-platform	3	655	August 27, 2022

Natural Language Processing & Word Embeddings

Related topics