Choosing metric for a binary classification (sentiment analysis) problem? how to use Binary Accuracy?

bluetail · April 4, 2022, 12:11pm

Hello,

I have a text corpus of movie reviews which need to be classified as positive(1) or negative(0).

I have chosen loss=‘binary_crossentropy’ and thinking about a metric.

I thought about using the F1-score which is good for binary classification, but do not see it in the list of metrics in Tensorflow 2.
I’m looking at ‘Binary Accuracy’ which calculates “how often predictions match binary labels”.
which seems fit for my problem.

yet, there is a threshhold parameter.

I have a balanced dataset, i.e. the number of positive labels match the number of negative labels in both the training and the testing sets.

What value should I be using for the threshold? leave it as default or set 0.5?

Thank you!

gent.spah · April 5, 2022, 4:26pm

If you have a balanced dataset the F1 score is not really of much use, because its mainly used for unbalanced datasets. Now about the threshold ; i think you are using the sigmoid function for binary classification, you should understand how this works:

if the sentiment is negative and your model is doing its job well, it should drive the output to almost 0
if the sentiment is positive and your model is doing its job well, it should drive the output to almost 1

so basically depending on how good the model is, the separation between negative and positive prediction should be large (if the model is good). If you feel the model is good than even a lower threshold would still be effective, otherwise you increase the threshold because you are not confident in your model.

bluetail · April 5, 2022, 9:05pm

thank you for the reply. would it be OK to use the binary_accuracy metric then?

gent.spah · April 5, 2022, 9:30pm

Since you have only 2 labels why not Binary_crossentropy for loss and accuracy for the metrics?

bluetail · April 6, 2022, 8:13am

yes, Binary_crossentropy for loss, and binary_accuracy for the metric? I am wondering why they have both binary_accuracy and accuracy?
thank you.

gent.spah · April 6, 2022, 10:46am

I see in tensorflow the accuracy doesnt have a threshold but other than that they look the same. Ultimately you have to read through the documentation.

Topic		Replies	Views
Metrics argument in model.compile Introduction to TF for Artificial Intelligence ... week-2	2	364	August 17, 2023
Assignment Exploring Overfitting in NLP - is it a binary or multiclassification problem? Natural Language Processing in TensorFlow	3	522	April 4, 2022
Week 3 Assignment - help with interpreting results Natural Language Processing in TensorFlow	2	338	December 22, 2022
Evaluation metrics for imbalanced image classification AI Discussions ai-discussions	2	101	May 1, 2023
C4W2 Programming Assignment 2: Transfer Learning with MobileNet Convolutional Neural Networks week-2	3	323	January 7, 2024

Choosing metric for a binary classification (sentiment analysis) problem? how to use Binary Accuracy?

Related topics