C5-W1-A3: Idiomatic usage of tf.keras.layers.LSTM vs LSTMCell

Jakob_Ullmann · October 7, 2022, 5:41pm

I was just wondering if the code given for the Jazz improvisation task is actually idiomatic usage of Tensorflow/Keras. There we loop over t in range(Tx) and apply the LSTM_cell over and over. But LSTM_cell is really a tf.keras.layers.LSTM object, rather than tf.keras.layers.LSTMCell. My understanding is that LSTM is supposed to get a whole sequence as input, and if return_sequences is set to True, it will output the entire sequence of outputs; LSTM has a member LSTMCell object which it uses for the steps. So it seems like this is actually an abuse of the LSTM class.
Can someone confirm if this is true, or is there some specific reason why the code was set up the way it is?

TMosh · October 8, 2022, 3:09am

I’m wondering if I understand your question completely.

Does this thread help?

Luis.BR · January 10, 2024, 11:14am

In the problem statement says: “The weights and biases are transferred to the new model using the global shared layers (LSTM_cell, densor, reshaper) described below”

LSTM_cell is a global variable, hence the second function for inference work with the same LSTM_cell object that it was trained in the previous function ‘djmodel’, LSTM_cell contains Weights and Biases.

Topic		Replies	Views
W1A3 jazz djmodel vs inference model Sequence Models	3	1129	July 26, 2022
The two models in the Jazz assignment Sequence Models	4	634	June 23, 2021
Week 1 assignment 3 Sequence Models	1	515	September 27, 2022
Week 1 music_inference_model Sequence Models	4	509	January 20, 2023
C5 W1 - Jazz Improv HW - What is the relationship between Exercise 2 and 3? Sequence Models	10	567	September 20, 2021

C5-W1-A3: Idiomatic usage of tf.keras.layers.LSTM vs LSTMCell

Related topics