C3W3 seq_pad_and_trunk

Brendon_Wolff-Piggot · October 17, 2022, 8:49am

I’m getting an error in the function seq_pad_and_trunc that I can’t understand. My output is appended below. Please advise.

Thanks!

TypeError Traceback (most recent call last)
in
1 # Test your function
----> 2 train_pad_trunc_seq = seq_pad_and_trunc(train_sentences, tokenizer, PADDING, TRUNCATING, MAXLEN)
3 val_pad_trunc_seq = seq_pad_and_trunc(val_sentences, tokenizer, PADDING, TRUNCATING, MAXLEN)
4
5 print(f"Padded and truncated training sequences have shape: {train_pad_trunc_seq.shape}\n")

in seq_pad_and_trunc(sentences, tokenizer, padding, truncating, maxlen)
16
17 # Convert sentences to sequences
—> 18 sequences = tokenizer.texts_to_sequences(sentences)
19
20 # Pad the sequences using the correct padding, truncating and maxlen

/opt/conda/lib/python3.8/site-packages/keras_preprocessing/text.py in texts_to_sequences(self, texts)
279 A list of sequences.
280 “”"
→ 281 return list(self.texts_to_sequences_generator(texts))
282
283 def texts_to_sequences_generator(self, texts):

/opt/conda/lib/python3.8/site-packages/keras_preprocessing/text.py in texts_to_sequences_generator(self, texts)
315 i = self.word_index.get(w)
316 if i is not None:
→ 317 if num_words and i >= num_words:
318 if oov_token_index is not None:
319 vect.append(oov_token_index)

TypeError: ‘>=’ not supported between instances of ‘int’ and ‘tuple’

balaji.ambresh · October 17, 2022, 8:53am

Course 2 week 4 deals with an mnist dataset. Please see this community user guide and move your topic to the correct sub-category.

Brendon_Wolff-Piggot · October 17, 2022, 9:09am

Thanks, this the moving now done. I didn’t find what I needed in the community guidelines, but trial and error moving the post worked
Unfortunately the error remains…

balaji.ambresh · October 17, 2022, 9:13am

Please click my name and message your notebook as an attachment.

balaji.ambresh · October 17, 2022, 9:29am

The mistake is not in seq_pad_and_trunc but in fit_tokenizer function. The 1st parameter of Tokenizer#init is num_words. This should not be set to sentences. Please read the docs and fix the constructor call.

Topic		Replies	Views
C3W3 Assignment padding problem Advanced Computer Vision with TensorFlow week-module-3	4	564	May 15, 2023
C3W3 Assignment - seq_and_pad - error in dependency Natural Language Processing in TensorFlow week-module-3	2	317	January 17, 2024
Natural Language Processing in TensorFlow: Week 3: Exploring Overfitting in NLP Natural Language Processing in TensorFlow	9	612	September 1, 2022
C3W3 assignment seq_pad_and_trunc failing test Natural Language Processing in TensorFlow	5	377	April 25, 2022
C3W2_Assignment : Error on def seq_and_pad Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	6	676	May 19, 2022

C3W3 seq_pad_and_trunk

Related topics