Picking the threshold point in the recall/precision curve

Amira_Al-Samawi · October 19, 2022, 2:23pm

Hello everyone!

I did not understand the following from the final video of week 3 regarding recall/ precision:

" Notice that picking the threshold is not something you can really do with cross-validation because it’s up to you to specify the best points. For many applications, manually picking the threshold to trade-off precision and recall will be what you end up doing"

Can anyone explain this to me, please?

alvaroramajo · October 19, 2022, 4:11pm

Hi, @Amira_Al-Samawi !

The precision / recall curve is a way of visualizing how is your model performing. A more restrictive threshold will only predict as positive the ones with the most confidence (higher precision) but will leave behind some true positives as well (lower recall), and vice versa

Amira_Al-Samawi · October 19, 2022, 8:20pm

@alvaroramajo Thank you for replying. I do understand what you wrote, but what I did not understand is the part in which cross-validation is mentioned. It is written that we cannot pick the point, but it is up to us to specify it. Is this sentence contradictory, or is there something I am missing?

rmwkwok · October 19, 2022, 11:48pm

Hi @Amira_Al-Samawi,

They are not contradictory.

Cross validation compares metric performance over models of different sets of hyperparameters. However, we should exclude “threshold” from such set of hyperparameters when the metric in question is precision or recall, because for example tuning the threshold down always increase recall.

You always achieve 100% recall when you set threshold equal to 0 even if your any other hyperparameters are completely non-sense. In other words, since reducing threshold always increase recall, nobody should care to tune other hyperparameters with the technique of cross-validation to achieve the goal of best recall.

We can divide metrics into 3 categories when threshold is concerned:

metrics that are monotonically increasing / decreasing with threshold, e.g. precision / recall
metrics that are independent of threshold, e.g. AUC (google “AUC Area under curve metric” for more)
other metrics

For type 1, we exclude threshold from cross validation. For type 2, tuning the threshold has no effect at all. For type 3, we can include threshold in cross validation.

Raymond

naveadjensen · November 12, 2022, 9:50pm

Hi @rmwkwok -
Your discussion here brings up another question that I’ve been thinking about when it comes to cross validation for NNs. It makes sense why you wouldn’t change threshold when looking at precision or recall. But what if you are looking at accuracy and error. Varying alpha and lambda seems to be pretty normal, but what other inputs could be varied? It makes sense that anything that would change the weights that are learned could be a potential input to change, so for example, the number of epochs, the number of neurons in each layer, and the number of layers. Are those three appropriate to change, or are they something that people don’t vary during cross validation usually?

rmwkwok · November 12, 2022, 10:02pm

Hi Navead,

We compare different settings of them in cross validation, and besides them, we can also compare the choice of activation function, the choices of input features (e.g. we can add polynomial features), the choice of how we initialize neural network’s weights, and so on. This list of tunable hyperparameters should cover anything related to the training data, and the neural network itself.

Cheers,
Raymond

Topic		Replies	Views
Optimize for Recall or Precision Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	402	October 5, 2023
Precision/Recall on which data? Advanced Learning Algorithms week-module-3	2	500	November 11, 2022
Balancing recall and precision Advanced Learning Algorithms week-module-3	5	367	September 4, 2023
Precision and Recall for different threshold or algorithm? Advanced Learning Algorithms week-module-3	1	514	July 16, 2022
Precision / Recall - Error metrics for skewed data Advanced Learning Algorithms week-module-3	5	507	August 11, 2022

Picking the threshold point in the recall/precision curve

Related topics