Unexpected output for fit_tokenizer function

Bruce_Wileman · January 11, 2023, 1:23pm

When testing the fit_tokenizer function, I get:

Vocabulary contains 27284 words

token NOT included in vocabulary

Instead of the expected:

Vocabulary contains 27285 words

token included in vocabulary

Any ideas why this might be the case?

balaji.ambresh · January 11, 2023, 2:57pm

Bruce_Wileman · January 12, 2023, 12:12pm

Great, that worked! Thanks!

Topic		Replies	Views
C3W1 Vocabulary tests as too small Natural Language Processing in TensorFlow week-1	1	544	October 2, 2022
Data mismatch with expected output int test for fit_toeknizer Natural Language Processing in TensorFlow week-2 , week-3 , week-4	3	531	July 17, 2022
TF Dev specialization, Course-3,wk-1, fit_tokenizer(): Natural Language Processing in TensorFlow week-1	3	503	February 21, 2023
C3W1 incorrect word count from fit_tokenizer() function Natural Language Processing in TensorFlow week-1	6	475	December 9, 2023
C3W1_Assignment fit_tokenizer() 00V problem Natural Language Processing in TensorFlow week-1	2	302	December 14, 2023