Data mismatch with expected output int test for fit_toeknizer

Rajneesh_Kumar_Verma · July 16, 2022, 7:06pm

Can someone please check if you are also getting a mismatch and if so what am I doing wrong here?

Output is:
Vocabulary contains 39432 words
<OOV> token NOT included in vocabulary

***Expected Output:***
Vocabulary contains 27285 words
<OOV> token included in vocabulary

Why is there a mismatch ? Am i doing something wrong here?

balaji.ambresh · July 17, 2022, 5:25am

The expected output is correct. If you’re having trouble locating the mistake, please click my name and message your notebook as an attachment.

balaji.ambresh · July 17, 2022, 7:59am

The 2nd parameter of Tokenizer is filters. Please use named argument to set oov token.

Rajneesh_Kumar_Verma · July 17, 2022, 10:24am

That was a real silly one! thanks for the help.

Topic		Replies	Views
Unexpected output for fit_tokenizer function Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	2	528	January 12, 2023
C3W1_Assignment fit_tokenizer() 00V problem Natural Language Processing in TensorFlow week-module-1	2	304	December 14, 2023
Vocabulary of labels- extra oov term Natural Language Processing in TensorFlow week-module-1	2	533	May 22, 2022
C3W1 Vocabulary tests as too small Natural Language Processing in TensorFlow week-module-1	1	554	October 2, 2022
C3W1: fit_token Natural Language Processing in TensorFlow week-module-1	4	554	November 16, 2022

Data mismatch with expected output int test for fit_toeknizer

Related topics