# GRADED FUNCTION: tokenize_labels

xaid · June 21, 2022, 9:29pm

The last function in the assignment in C3: W1 assigns to the first element in the vocab although I did not add oov_token=“” as an argument when initialized the Tokenizer. Any idea?

balaji.ambresh · June 22, 2022, 5:17am

Please click my name and message your notebook as an attachment.

balaji.ambresh · July 14, 2022, 1:51pm

@Husam_Abu_Asbeh

In function train_val_split, you should not assign a constant value like 1780 to train_size. Use function parameters to arrive at the value of train_size. If training_split is 0.8, then training data should be 80% of the rows and validation set should contain the remaining 20 % of the rows.

Husam_Abu_Asbeh · July 14, 2022, 1:57pm

no sorry, i sent you the wrong file
this is the correct notebook about OOV

[code removed - moderator]

balaji.ambresh · July 14, 2022, 2:16pm

def tokenize_labels(labels) has a bug. There’s no need to use OOV when tokenizing labels.

Husam_Abu_Asbeh · July 14, 2022, 3:15pm

but i didn’t pass additional arguments in tokenize_labels, why the result show like this ?

balaji.ambresh · July 14, 2022, 3:19pm

You are calling fit_tokenizer for creating a tokenizer for the labels. This function assigns oov token and hence the problem.

Husam_Abu_Asbeh · July 14, 2022, 3:25pm

i tried to delete this but it’s wasn’t make any sense

balaji.ambresh · July 14, 2022, 3:26pm

You should create a tokenizer without oov. Please think about it. The dataset has labels for inputs. Why would you need an oov token for the labels?

Syed_Mohammad_Farzan · March 4, 2023, 12:50pm

What happen, Let me see too

Topic		Replies	Views
Lab: Label Tokenizer does not accept oov Natural Language Processing in TensorFlow week-module-1	4	562	June 27, 2022
I am getting error in tokenizer Natural Language Processing in TensorFlow	12	354	January 17, 2023
Vocabulary of labels- extra oov term Natural Language Processing in TensorFlow week-module-1	2	537	May 22, 2022
Help with tokenize_labels Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	5	654	April 29, 2022
C3W1_Assignment fit_tokenizer() 00V problem Natural Language Processing in TensorFlow week-module-1	2	312	December 14, 2023

# GRADED FUNCTION: tokenize_labels

Related topics