Why not evaluate models on test set?

Hi, @djdevilliers.

This is a recurrent question in this forum. As @gent.spah says, it can also overfit the dev set.
Check this post for more details if you want.

1 Like