W1 A3 Jazz Music music_inference_model

Elemento · August 8, 2021, 11:50am

In this assignment, in 2.D, we have to convert the output from the previous LSTM cell into a new input for the current LSTM cell, and we are instructed to do this in 2 steps:

Get the index of the maximum value of the predicted output using tf.math.argmax along the last axis.
Convert the index into its n_values-one-hot encoding using tf.one_hot.

I am confused here, as to why we are instructed to find the index of the maximum value. If we will do this, then won’t we always sample the same music sequence. Why not we are choosing the index randomly proportional to their associated probabilities, just like the previous assignment, i.e., the dinosaur one.
Am I missing something?

TMosh · August 13, 2021, 4:27am

That’s a good question, and I’ve wondered that myself.

I don’t know of any good reason. The notebook for the dinosaur names assignment says it uses random sampling “to make the results more interesting”.

Apparently, the author of the jazz music exercise was happy to have the code generate the same tune every time.

It would be interesting to use random sampling and see what happens.

Elemento · August 13, 2021, 11:18am

Thanks a lot @TMosh for your reply. I thought I was missing something out in the Jazz Notebook!

Topic		Replies	Views
Music interference model Sequence Models coursera-platform	1	564	September 1, 2021
Improvise_a_Jazz_Solo_with_an_LSTM_Network_v4 exersice no 2 Sequence Models coursera-platform	2	673	May 23, 2021
Week 1 - Assignment 3 Sequence Models coursera-platform	43	4881	September 8, 2023
W1 - Assignment 3: Why do we recalculate the one-hot vectors and indices in predict_and_sample after we did it in music_inference_model? Sequence Models coursera-platform	2	563	December 4, 2022
I am having trouble in 'jazz Improvisation with LSTM" exercise no 3 Sequence Models coursera-platform	4	887	May 17, 2021

W1 A3 Jazz Music music_inference_model

Related topics