What should be denominator in computing accuracy?

Victor_Luu · November 1, 2022, 7:14am

I understand about using mask to get rid of pad tokens and keep only actual predictions. So the number of correct predictions should be np.sum(outputs * mask == labels * mask). But what should be the total number of predictions then? I tried np.sum(mask), but got a wrong accuracy, larger than 100%.
Printing out my number of predictions shows that it is smaller than the number of correct predictions, so using np.sum(mask) should be wrong. But why? What should be the correct answer then?

My code, FYI:

    mask = (labels != pad)
    n_correct = np.sum(outputs * mask == labels * mask)
    n_prediction = np.sum(mask) 
    print("no. of correct predictions:", n_correct)
    print("total actual predictions:", n_prediction)
    accuracy = n_correct / n_prediction

paulinpaloalto · November 1, 2022, 7:00pm

The problem is not with the denominator: it is with the numerator. Notice that the way you wrote that, you get a True value for every position in which mask is 0. After all 0 == 0 is true, right?

Victor_Luu · November 2, 2022, 4:43pm

Oh, I got it. Thanks for the explanation. So I guess I need something like: sum(outputs == labels) where mask == 1.

Victor_Luu · November 2, 2022, 4:53pm

Yeah, I got it right this time, thanks @paulinpaloalto .

Topic		Replies	Views
Exercise 4: compute accuracy NLP with Sequence Models week-module-3	6	640	May 18, 2022
C3 Assignment 3 E4 Problem with understanding evaluate_prediction NLP with Sequence Models week-module-3	9	659	November 8, 2023
C3 Week3 Assignment Exercise 4 NLP with Sequence Models week-module-3	3	639	July 28, 2022
General question - accuracy function Neural Networks and Deep Learning coursera-platform	4	532	January 9, 2022
W3_A1_Ex-5.2_Understanding math behind accuracy calculation Neural Networks and Deep Learning coursera-platform	2	408	July 21, 2023

What should be denominator in computing accuracy?

Related topics