Passed the quick test but failed on the unit test. Have spent seveal hours to debug but no solution. Need help on this issue, Thx.
See screenshot:
Passed the quick test but failed on the unit test. Have spent seveal hours to debug but no solution. Need help on this issue, Thx.
See screenshot:
I figured out the solution.
Basically, in computing precision and recall, you have to account for the “raw” count of candidates and references, not just the “unique” count which would be smaller in case there are duplicates in either candidates or references.