Data Generator Logic

Pirzada · June 25, 2023, 10:50am

I am doing the assignment for 1st week and I found that It is done in this way

tweet = data_pos[pos_index_lines[pos_index]]

My question is that why we cannot make a data generator in this way

data_pos[i : i+batch_size]

What is the logic behind this?

arvyzukai · June 26, 2023, 6:15am

Usually there are many ways (codes) to accomplish the same results. I guess the creators of the course thought the suggested way is more understandable (natural) for learners that are not very familiar with vectorized code and the loops (“while, for”) might be more familiar.

So, there would be problems when the i is near the end of and array (when + batch_size is outside the array) and you should handle that. Also, you should have to convert the tweet to tensor anyway.

So, all in all I think their choice was for a more conventional way.

Topic		Replies	Views
Trouble with logic and syntax of generator NLP with Sequence Models week-1	2	542	January 27, 2022
C3 W1 Lab: Data generator NLP with Sequence Models week-1	3	574	November 17, 2022
Week 1 Homework Question NLP with Sequence Models	1	247	December 29, 2021
C3_W1_Assignment error in building data generator NLP with Sequence Models week-3	2	342	September 26, 2023
Why is data_generator created inside the loop in classify? NLP with Sequence Models week-4	1	570	October 17, 2022

Data Generator Logic

Related topics