Error in video's ROUGE-1 `precision` calculation

At 6:32 in the video, when the reference sentence is It is cold outside.:

Take, for example, this generated output,
cold, cold, cold, cold.

As this generated output contains one of the words from the reference sentence, it will score quite highly even though the same word is repeated multiple times.[emphasis added]

The Rouge-1 precision score will be perfect.

The precision score calculation shown on the whiteboard in the video has 4 as the number of unigram matches, ignoring, as indicated by the transcript, the fact of repetition. But repetitions should not be counted; what should be counted are unique instances. The number of matches should thus be 1, and the precision score should be 1/4. See Google Research’s Python code at

google-research/rouge/rouge_scorer.py at master · google-research/google-research · GitHub

The number of matches, in pseudo-code, is:

for each unigram in reference:
   matches += min(reference instances, output instances)

If correct understand there should be:

ROUGE-1 Recall: 0.25 (25%)
Precision: 0.25 (25%)

Yes, the same correction should apply to Recall.