Module2 Lab: Retrieval metrics: Recall: false negatives calculation unclear

p.shar · July 24, 2025, 12:14pm

Hi.

Module2 Lab: Retrieval metrics:
The false negatives calculation in the given code works out to
true_positives - true_positives, which will always be 0. Therefore, the Recall will always be 1.0

What am I missing?

I am referring to the following code snippet in the jupyter notebook:

false_negatives = sum(
             newsgroups_train.target_names[df.iloc[idx]["category"]] == desired_category 
             for idx in top_results
         ) - true_positives

Thanks in advance for your help.

janhendrik · July 26, 2025, 8:48am

I believe this is an error in the code. To calculate false negatives, we need to find documents that have the correct category, but are not included in the top k. So the expression should not sum over top_results, in my opinion, but over all documents, and then subtract the true positives.

p.shar · July 26, 2025, 12:33pm

Thank you! That clears it up.

jsun0326 · August 12, 2025, 4:53am

I wish I saw your post earlier. I just posted about the same issue one minute ago. I think there is an error in the way how false negatives were computed in the provided lab. Happy that I am not the only that noticed the issue. Link to my post: Computer false negatives in C1M2_ungraded_Lab_2

p.shar · August 12, 2025, 6:03am

I hear you. @janhendrik and I believe that the false negatives calculation (and, therefore, recall) in the provided lab is not how they are calculated in the practical world. Perhaps the lab is using an assumption for some reason that might have to do with the top_k parameter in the sense that if top_k is a larger number then there would be fewer false negatives. But this line of thinking is… impractical?

I guess we can hope that one of the mentors will clear this up sooner rather than later. Till then,

false negatives = all desired_category docs in dataframe - true positives

TMosh · August 12, 2025, 6:18am

@Community-Team, who is the lead technologist for the RAG course?

Community-Team · August 12, 2025, 3:52pm

Hello @TMosh ,

@lucas.coutinho is OOO. I will reach out to the team.

lucas.coutinho · August 19, 2025, 1:04pm

Hi!

Yes, there is an error in the code - in fact it should be Precision@K and Recall@K instead of just precision and recall. We’re working on updating the ungraded lab with this new version.

Thanks,

Lucas

Topic		Replies	Views
Computer false negatives in C1M2_ungraded_Lab_2 Retrieval Augmented Generation week-module-2 , dl-ai-learning-platform	1	19	August 12, 2025
Week3 precision and recall Advanced Learning Algorithms week-module-3	1	20	June 21, 2025
Question on the Modelling Challenge quiz Machine Learning in Production	5	625	June 27, 2021
Logical Error in C2W2_Assignment - Breast Cancer Prediction Custom and Distributed Training with TF week-module-2	5	35	September 1, 2024
#Week3 - Skewed datasets - prevision/recall metrics Advanced Learning Algorithms week-module-3	2	247	February 20, 2024

Module2 Lab: Retrieval metrics: Recall: false negatives calculation unclear

Related topics