Week1 building RNN step by step assignment - questions about input data dimension

Ethan · July 4, 2021, 10:44pm

Hello,

In this assignment, we set the dimension of the input data x to (n_x, m, T_x), with n_x being the size of the corpus, m being the batch size, and T_x being the timesteps.

Since T_x is a fixed number, do we assume that all input has the same number of timesteps? (e.x. if inputs are sentences, then all sentences will have the same length). If so, how do we perform forward propagation with different input lengths?

Thanks for answering in advance.

Ethan

TMosh · July 5, 2021, 4:43am

The exercise instructions tell you this:

Tx will denote the number of timesteps in the longest sequence.

Shorter sentences are padded to the correct length.

TMosh · July 5, 2021, 4:51am

Or, instead of padding shorter sentences, you can use a code value that indicates the end of the sentence.

Ethan · July 5, 2021, 4:05pm

I understand the idea of padding, but how does adding an indicator work? Assume that we add an encoded version of “\n” to the end of each sentence, they still have different lengths of timesteps and cannot be stored into a 3d array. I will appreciate it if you can give me a concrete example.

TMosh · July 5, 2021, 4:31pm

To use an end marker, you’d probably have to re-structure the code so the examples can be different lengths.

Ethan · July 5, 2021, 4:56pm

Does that mean vectorization is no longer applicable? Besides, I couldn’t find any related blog posts or papers that address this problem, could you please send me something that I can read about?

Thank you.

TMosh · July 5, 2021, 8:51pm

Sorry, I don’t have any references for this.

Ethan · July 6, 2021, 12:50pm

That’s okay, thank you for your time!

Topic		Replies	Views
C5W1A1sequences for training with different length Sequence Models coursera-platform	2	513	February 23, 2022
DLS 5 - Input/output of varying window sizes Sequence Models coursera-platform	7	533	June 8, 2022
RNN Shapes Clarification Sequence Models coursera-platform	2	522	July 4, 2022
RNNs with varying input text lengths Sequence Models week-module-1 , coursera-platform	6	179	March 15, 2024
RNN input doubt Sequence Models coursera-platform	8	443	June 2, 2023

Week1 building RNN step by step assignment - questions about input data dimension

Related topics