NLP Course 3 Week 3: Confusion matrix

Taisiya_Kopytova · July 8, 2024, 11:16am

Hi!

So I have a question about Exercise 4. I get the right values for the confusion matrix but some of the are at the wrong place. Same for the half of the examples in the unit test. This is what I get:

Accuracy 0.7259765625 Confusion matrix: [[1506 4876] [1300 2558]]

Expected Result

Accuracy ~0.725

Confusion matrix:

[[4876 1506]
[1300 2558]]

And

Wrong confusion matrix for threshold=0.7 and batch_size=512
Expected:[[1624 525]
[ 473 878]],
Got:[[ 525 1624]
[ 473 878]].

Wrong confusion matrix for threshold=0.75 and batch_size=512
Expected:[[1747 421]
[ 612 720]],
Got:[[ 421 1747]
[ 612 720]].

Wrong confusion matrix for threshold=0.7 and batch_size=256
Expected:[[1647 521]
[ 468 864]],
Got:[[ 521 1647]
[ 468 864]].

Wrong confusion matrix for threshold=0.8 and batch_size=256
Expected:[[1857 311]
[ 802 530]],
Got:[[ 311 1857]
[ 802 530]].

4 tests passed
4 tests failed

What could be a reason for that?
Thanks!

Alireza_Saei · July 8, 2024, 11:38am

Hi @Taisiya_Kopytova

Make sure that you correctly calculate and assign the values of TP, FP, TN, and FN as below:

[[TN, FP],
 [FN, TP]]

Hope it helps! Feel free to ask if you need further assistance.

Taisiya_Kopytova · July 8, 2024, 11:43am

Hi @Alireza_Saei

Thank you for replying!

I use tf.math.confusion_matrix, so I can’t control the calculation of TP, FP, TN, and FN.

Alireza_Saei · July 8, 2024, 11:53am

Your code snippet looks correct. However, make sure threshold is set appropriately for your problem. It should reflect the threshold above which you classify a sample as positive. Also, double-check that y_test and y_pred are of the correct shape and type. They should be compatible with the operations you are performing, especially with tf.math.confusion_matrix.

If these conditions are met and the rest of your implementation is correct.

Taisiya_Kopytova · July 8, 2024, 12:23pm

@Alireza_Saei I’m not sure what you mean by setting the threshold correctly. If it is correct in the snippet, what else could be a problem? And given that the accuracy value is calculated correctly, the threshold is not a problem.
The shape for y_test and y_pred in the first example is (10240,).

Would you mind to check that everything is calculated correctly from your side? It looks like that for the example and unit tests TP, FP, TN, and FN were calculated without the use of tf.math.confusion_matrix and I suspect that this is where the problem could lie.

Taisiya_Kopytova · July 8, 2024, 12:32pm

@Alireza_Saei I believe there might be a bag in unit tests and the grader. Check another similar post

Deepti_Prasad · July 8, 2024, 12:34pm

Send your codes via personal DM. Do not post codes on public post thread. kindly follow community guidelines.
@Taisiya_Kopytova

please remove any code snippet shared here using edit option

Taisiya_Kopytova · July 8, 2024, 12:35pm

@Deepti_Prasad Thank you for reminding me.

Deepti_Prasad · July 8, 2024, 12:39pm

Hi @Alireza_Saei

Kindly do not confirm codes are correct if you do not have access to the complete codes provided by learner, this can misguide learner as he ended responding on another post.

There is issue with his codes which does require review to his overall codes.

Regards
DP

Deepti_Prasad · July 8, 2024, 1:09pm

Hi @Taisiya_Kopytova

You somehow mixed up two code lines

Check if d>threshold to make predictions
Here you were only suppose to check if cosine similarity is greater than threshold but you ended up adding the accuracy part of the code to this code line. Also do not hard code the path by introducing res = d > threshold (I KNOW THE HINT FROM PREVIOUS CELLS TELLS YOU TO DO SO.
INSTEAD OF y_test == res, you only need to use tf.cast(d>threshold with the datatype).
Next your accuracy code is incorrect
take the average of correct predictions to get the accuracy
accuracy = tf.math.reduce_mean(y_pred)

You need to use tf.reduce_mean to the tf.cast when your actual target is absolute equal to the prediction with the datatype.

Regards
DP

Taisiya_Kopytova · July 8, 2024, 1:22pm

@Deepti_Prasad
Thank you very much for the explanation! It worked with the unit tests but the grader is complaining now

Deepti_Prasad · July 8, 2024, 1:38pm

This could be related to other code cells @Taisiya_Kopytova, so send your assignment for code review. Passing a unit test doesn’t confirm you always will pass the assignment. There could be other issue like using global variable instead of local variable. You can share the grader output here without sharing the codes.

Regards
DP

Taisiya_Kopytova · July 8, 2024, 1:39pm

@Deepti_Prasad I’ve restarted the kernel and rerun all cells and the problem was solved. Now the code passes the grader. So I think you’re right that it was some local/global variable issue

Alireza_Saei · July 8, 2024, 2:11pm

Hi @Deepti_Prasad

Sure, I will make sure to see all the code before guiding the learners and be more careful from now on! Thanks for letting me know.

Topic		Replies	Views
C3W3 Assignment Exercise 4 Evaluation/classify - Confusion matrix is wrong (not exactly matches to the expected output) NLP with Sequence Models week-3	16	832	January 1, 2025
C3_W3_Exercise 04_Incorrect accuracy and confusion matrix NLP with Sequence Models week-3	7	53	January 5, 2025
C3W3_Assignment Exercise 4 Invalid Confusion Matrix NLP with Sequence Models week-3	4	24	January 2, 2025
C3W3 Issues with Exercise 4: Classify NLP with Sequence Models week-3	4	14	February 16, 2025
C3W3 - Ex4 - classify - CM correct, accuracy far to low NLP with Sequence Models week-3	4	72	November 7, 2024

NLP Course 3 Week 3: Confusion matrix

Expected Result

Related topics