W 2 A2 | Emojify_v2 model

Aroonima · December 8, 2021, 9:27am

Hi,

Could anyone please clarify why the 2nd layer of LSTM has return_sequences= False?
So what is the output we are expecting from this layer and where is it passed?
Also the graph of the model points the output of this layer outside. Hence there is some confusion. Request some clarity here.

Thanks

TMosh · April 21, 2022, 3:47am

Were you able to complete the assignment?

Aroonima · April 21, 2022, 1:17pm

Yes, I completed the assignment.

Thanks

Milan_Adamovic · August 19, 2022, 10:00am

I have the same question, so I am reopening this thread as it wasn’t really answered.

What is the purpose of the return_sequences parameter? This is not explained well in the documentation.
Why do we set it to True in the first LSTM layer, and False in the second?

Related to this question (and possibly the answer) - in a Week 1 assignment, we used loop to build a recurrent model. Here we don’t do it, but I assume that Keras somehow does it automatically.

Little more detailed explanation and answer to the original post would be appreciated.

Aroonima · August 19, 2022, 11:48am

Hi @Milan_Adamovic ,

I was able to figure out the answer myself.
Since you have the same query, let me clarify the same for you.
The assignment is a classification problem since we want to classify each of the input sentences to an emoji.
Let’s say we keep the model simple by having only 1 LSTM layer. When the output of the embedding layer is passed as an input to LSTM, a single output is expected, hence the argument ‘return_sequences =False’.
In this assignment, the model is a bit more complex, with an additional LSTM layer. The LSTM layer accepts inputs which corresponds to the length of input sentence, so it expects a sequence as an input rather than a single value. Hence the first layer mentions ‘return_sequences= True’. This gets passed as input to the 2nd LSTM layer but here we mention ‘return_sequence= False’ as want a single output ie the output from the last unit of the last timestep of LSTM layer.
I hope this clarifies your doubt as well.

Regards
Aroonima

Milan_Adamovic · August 21, 2022, 10:26am

Thanks for your explanation @Aroonima.

To my second question - how do activations get propagated across the LSTM nodes in the sequence? (In a week 1 assignment we had to do it programatically, in a loop)?

Aroonima · August 22, 2022, 1:58am

Hi @Milan_Adamovic,

The LSTM layer encompasses all the activation forwards for the entire input sequence(same as what was covered in week1), so no loops are required.

Here is the link for the LSTM documentation. You will get a clear idea once you go through it.
tf.keras.layers.LSTM | TensorFlow v2.9.1.

Regards
Aroonima

realnoob · September 16, 2022, 3:24pm

I don’t think it has anything to do with the shape of the input. From the documentation:

“By default, the output of a RNN layer contains a single vector per sample… The shape of this output is (batch_size, units)”

" A RNN layer can also return the entire sequence of outputs for each sample (one vector per timestep per sample), if you set return_sequences=True . The shape of this output is (batch_size, timesteps, units)"

So the reason we set return_sequences=True is because we want to output a vector for each word (aka timestep) in each sentence sample, not a vector for each sentence sample by default.

And that begs the question why we need this form of output in the first LSTM but not the second? It would be great if anyone can share their thought on this.

Rashmi · October 13, 2022, 12:52pm

Hello Vu Ji Thi,

Yes, you have rightly explained why we are setting return_sequences as True in the first place.

[quote="realnoob, post:8, topic:73088"]
So the reason we set `return_sequences=True` is because we want to output a vector for each word (aka timestep) in each sentence sample, not a vector for each sentence sample by default.
[/quote]

Topic		Replies	Views
Emojify_V2 LSTM return_sequences Argument Sequence Models week-2	4	132	October 21, 2024
Course 5, week 2, assignment 2 Sequence Models	2	285	December 8, 2023
Return Sequence = True in Last LSTM Layer Sequences, Time Series and Prediction week-4	1	786	April 13, 2022
C5W2A2 Emojify V2 Sequence Models	7	287	December 3, 2023
C5_W3_A1 Setting "return_sequence" or "return_state" Sequence Models	1	255	December 13, 2023

W 2 A2 | Emojify_v2 model

Related topics