Assignment Exploring Overfitting in NLP - is it a binary or multiclassification problem?

bluetail · April 4, 2022, 3:04pm

After reading this, I have assumed it is a binary classification problem (negative vs positive reviews).

Parsing the raw data

The labels are originally encoded as strings (‘0’ representing negative and ‘4’ representing positive). You need to change this so that the labels are integers and 0 is used for representing negative, while 1 should represent positive.

So I have encoded 0 for negative and 1 for positive.

however, there could be the multi- labels, 0,1,2,3,4, and so we should be using cross-entropy loss.

Can a mentor reply which one is it please? binary or multi-class?

CSAlexiuk · April 4, 2022, 5:26pm

Hey bluetail!

Excellent question!

Because the labelled data is either 0 or 4 - and we encode those to 0 and 1 respectively, this is a binary classification problem.

As an exercise, you can check to verify that the only labels present in the dataset are “0” and “4”, which should help confirm that this is, indeed, binary classification!

Hopefully that helps

Have an awesome day!

bluetail · April 4, 2022, 7:17pm

thank you. do you also know how to get one of the target curves shown? that was my another question for this assignment, about jagged curves:

that said, I have passed the grader with my solution.

CSAlexiuk · April 4, 2022, 9:49pm

I will definitely take a look and see if I can help out with your other question!

Thanks,
Chris!

Topic		Replies	Views
Weighted loss pytorch AI Discussions	2	61	June 19, 2023
Week 3 Assignment - help with interpreting results Natural Language Processing in TensorFlow	2	338	December 22, 2022
Multi-class Y values Advanced Learning Algorithms week-2	5	520	July 14, 2022
TF C3W3 assignment results too good Natural Language Processing in TensorFlow week-3	7	76	December 15, 2023
C3W2 Assignment Labels Shape Natural Language Processing in TensorFlow week-2	7	34	March 10, 2025

Assignment Exploring Overfitting in NLP - is it a binary or multiclassification problem?

Parsing the raw data

Related topics