It is a good question, but I don’t know if there is a definitive answer. It looks like it’s just a choice that they made to use the L2 norm directly in the verify case. Here’s another recent thread on this topic.
It works either way, but I think using the square is more computationally efficient, so it makes more sense to use that. But they didn’t ask my opinion. 