Hi @vrathi
You should not get the len
of ref_counter, but the length of the “system” (the model translation). Note that length of the counter would be smaller if there are duplicate words in a translated sentence.
Similarly, for the recall you should check against the length of the “reference” (the true translation). Also, the same “note” applies as in precision denominator.
Cheers