Skip Grams & Negative Sampling Softmax function

edwardyu · May 31, 2021, 8:06am

The formula is 10,000 ways (#of vocabulary) softmax classification output. Input is context word one-hot vector, hidden layer is embedding vector (weights between input and hidden layers is embedding matrix), output is target word probabilities vector, weights between hidden layer and output layer is Theta. Here has a drawing, may help you understand skip-grams.

Topic		Replies	Views
Week 2: Question about parameters Sequence Models coursera-platform	1	519	December 2, 2021
Natural Language Processing & Word Embeddings Sequence Models coursera-platform	3	672	May 30, 2025
Some confusion on Word2Vec model NLP with Sequence Models week-module-2	1	486	July 5, 2023
Skip_Gram modification-course 5, week 2, negative sampling Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	489	September 20, 2022
RNN Cost Function Sequence Models coursera-platform	3	505	April 20, 2023

Skip Grams & Negative Sampling Softmax function

Related topics