Difference sequence other than the expected output

Basira_Daqiq · October 25, 2023, 10:43pm

My code outputs a slightly different sequence than the expected output. I am not sure what the problem could be. My output:

The first padded sequence looks like this: [96 1 1 … 0 0 0]

The numpy array of all sequences has a shape: (2225, 2438) This means there are 2225 sequences in total and each one has a size of 2438

Expected Output:

First padded sequence looks like this: 

[  96  176 1157 ...    0    0    0]

Numpy array of all sequences has shape: (2225, 2438)

This means there are 2225 sequences in total and each one has a size of 2438

Basira_Daqiq · October 25, 2023, 10:53pm

fixed it, I had passed a num_words = 100 into the tokenizer. Removed it and now it matches.

Topic		Replies	Views
Get_padded_sequences Natural Language Processing in TensorFlow week-module-1	6	568	December 23, 2022
I keep on getting 0 points on seq_and_pad Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	3	563	May 20, 2022
C3W1-Assignment -> too much words in vocab and wrong shape Natural Language Processing in TensorFlow	12	468	January 25, 2024
Make sure all arrays contain the same number of samples Natural Language Processing in TensorFlow week-module-2 , week-module-3 , week-module-4	6	626	July 10, 2022
C3W3 seq_pad_and_trunk Natural Language Processing in TensorFlow	4	326	October 17, 2022

Difference sequence other than the expected output

Related topics