Week 2 : How is Negative Sampling a bunch of Binary problems?

Shri_Harsha · June 15, 2021, 2:01pm

During the lecture in the Negative Sampling to find the Word Embeddings, Andrew called it as a multiple binary classification problems rather than single softmax problem.

As per Andrew the above image represents Training dataset. But for me it seems like it is single binary problem i.e., given the “Context” and “Word” our target is either 1 or 0

I realized I was right after seeing Andrew righting formula that was mentioned above.

But while drawing its neural network representation, Andrew said it was multiple binary class problem but the formula written above says otherwise.

So is the formula correct or neural network representation part posted below which doesn’t make any sense to me?

edwardyu · June 17, 2021, 10:02am

Both are correct. The “multiple binary classification”, aka “multi-label classification” (instead of multi-class), applies the formula (sigmoid) on each output neuron. In negative sampling, we only care (calculate training loss) about K+1 neurons, and ignore other neurons at each iteration.

Awong · March 10, 2025, 3:05am

Hi, I don’t fully get this, could you explain using other words?

TMosh · March 10, 2025, 3:57am

@Awong. You have replied on a thread that has been cold for four years.

I recommend you start a new thread for your question.

Topic		Replies	Views
Week 2: How is Negative Sampling a bunch of logistic regression problems? Sequence Models week-module-2 , coursera-platform	5	34	March 11, 2025
What is the overall loss function when using Skip-gram with Negative Sampling technique? Sequence Models week-module-2 , coursera-platform	4	285	May 2, 2024
Training with Negative Sampling Sequence Models coursera-platform	1	566	October 11, 2022
C5 W2 Embedding words Sequence Models coursera-platform	3	533	March 30, 2023
Skip_Gram modification-course 5, week 2, negative sampling Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	492	September 20, 2022

Week 2 : How is Negative Sampling a bunch of Binary problems?

Related topics