When calling TensorFlow LSTM layer instance, why retrieve hidden state and cell state from first and third element of returns？

Ron1 · September 23, 2021, 12:25pm

In assignment C5W1A3, the “Step 2: Loop through time steps” section says that using following formatting to retrieve hidden state and cell state.

next_hidden_state, _, next_cell_state = LSTM_cell(inputs=input_x, initial_state=[previous_hidden_state, previous_cell_state])

I read the documentation of TensorFlow LSTM layer in tf.keras.layers.LSTM | TensorFlow Core v2.6.0. It provide an example, when setting “return_state=True”, they use following code to retrieve return values.

whole_seq_output, final_memory_state, final_carry_state = lstm(inputs)

I could not find the description of thees three return values. So I guess that the first return value means prediction y in lecture, the second return value means hidden state, and the third return value means cell state.
But this is different from the usage in assignment, where the first return value means hidden state and the third return value means cell state.

I am confused about these two usages. Which one is correct?

Thank you.

paulinpaloalto · September 23, 2021, 6:27pm

The TF LSTM API has a zillion complicated options. You can see in the example you quote that they are using return_sequences = True, which is not the default. In our case, we are using the default for that option, but setting return_state = True. They do give at least a modicum of explanation or description in the “options” section of the relevant docpage. Worth a more detailed look …

Ron1 · September 24, 2021, 4:10am

I misunderstand the “output” in the TensorFlow LSTM documentation as the prediction y hat in the lecture.

Thanks for helping.

Topic		Replies	Views
Explanation of LSTM_cell call Sequence Models coursera-platform	6	679	May 3, 2023
C5_W3_A1 Setting "return_sequence" or "return_state" Sequence Models coursera-platform	1	255	December 13, 2023
Return values from calling LSTM_cell in Jazz assignment Sequence Models coursera-platform	1	521	March 13, 2022
LSTM unit outputs Sequence Models coursera-platform	3	548	February 15, 2023
Emojify_V2 LSTM return_sequences Argument Sequence Models week-module-2 , coursera-platform	4	132	October 21, 2024

When calling TensorFlow LSTM layer instance, why retrieve hidden state and cell state from first and third element of returns？

Related topics