Picking the threshold point in the recall/precision curve

rmwkwok · October 19, 2022, 11:48pm

They are not contradictory.

Cross validation compares metric performance over models of different sets of hyperparameters. However, we should exclude “threshold” from such set of hyperparameters when the metric in question is precision or recall, because for example tuning the threshold down always increase recall.

You always achieve 100% recall when you set threshold equal to 0 even if your any other hyperparameters are completely non-sense. In other words, since reducing threshold always increase recall, nobody should care to tune other hyperparameters with the technique of cross-validation to achieve the goal of best recall.

We can divide metrics into 3 categories when threshold is concerned:

metrics that are monotonically increasing / decreasing with threshold, e.g. precision / recall
metrics that are independent of threshold, e.g. AUC (google “AUC Area under curve metric” for more)
other metrics

For type 1, we exclude threshold from cross validation. For type 2, tuning the threshold has no effect at all. For type 3, we can include threshold in cross validation.

Raymond

Topic		Replies	Views
Threshold define for precision and recall Advanced Learning Algorithms week-module-3	2	229	March 17, 2024
Can precision get lower when rise threshold? Advanced Learning Algorithms week-module-3	13	101	June 23, 2024
Precision/Recall on which data? Advanced Learning Algorithms week-module-3	2	500	November 11, 2022
Precision / Recall - Error metrics for skewed data Advanced Learning Algorithms week-module-3	5	502	August 11, 2022
Precision and Recall for different threshold or algorithm? Advanced Learning Algorithms week-module-3	1	503	July 16, 2022

Picking the threshold point in the recall/precision curve

Related topics