Can someone please check if you are also getting a mismatch and if so what am I doing wrong here?
Output is:
Vocabulary contains 39432 words
<OOV> token NOT included in vocabulary
***Expected Output:***
Vocabulary contains 27285 words
<OOV> token included in vocabulary
Why is there a mismatch ? Am i doing something wrong here?