Course 5: Sequence models - Handling the padding?

mashoo · May 28, 2021, 6:22am

Hi,

I am following course 5 and re-implemented both RNN and LSTM in Julia.

I run LSTM on dinos.txt dataset (not currently in git repo code).
However, it gives me very low accuracy hardly exceeding ~84%, I believe it has to do with the padding that network is trying to learn because I am processing my sequences in mini-batches.

I am a bit confused how this is handled in the course, is the model trained by single sequence? It does not look like. So how are the different lengths handled in a batch?

I tried using loss that ignores the padding, but my results got even worse.
Thanks for any help.

TMosh · June 27, 2021, 6:28am

Sorry, I don’t have any experience with Julia.

Topic		Replies	Views
Improving Training accuracy of LSTM in C3W4 assignment Natural Language Processing in TensorFlow week-module-4	6	386	August 3, 2023
Does only transformer need padding using max_length? Sequence Models coursera-platform	8	880	March 8, 2023
C3_W3_Lab_2_multiple_layer_LSTM Natural Language Processing in TensorFlow	1	335	October 14, 2022
C3_W4 Assignment: Padding in excercise 2 NLP with Sequence Models week-module-4	6	515	March 6, 2023
DLS 5 - Input/output of varying window sizes Sequence Models coursera-platform	7	534	June 8, 2022

Course 5: Sequence models - Handling the padding?

Related topics