Hello, I have a couple questions on Assignments 2 and 3 in Week 1 of Sequence Models.
-
Why does Assignment 2 (Dinosaur name generation) use a basic RNN cell whereas Assignment 3 (Music Generation) uses an LSTM cell?
-
Dinosaur name generation makes sense to me, as it uses x to predict y=x<t+1>. However, I am confused by music generation. I thought it would be the same case but instead the notebook says that Y is the same as X but shifted one to the left (aka the past). Why would we be using xt to predict xt-1?
-
I also don’t understand the how the model works in the Music Generation assignment. djmodel runs all of the time steps in X to generate a list of outputs (I believe that this model’s parameters are not updated). And then these outputs are used in another model which is then trained. I have a couple confusions here:
-
Why do we have a second model here instead of just training djmodel?
-
What even are the outputs generated by djmodel that are used in the second model that is actually trained? I’m not sure how the second model learns anything useful.
-
How does the second model know the model architecture if it only is given the inputs and outputs? (I think this is more of a confusion around the how the Functional API model works)
I would really appreciate any help here. Thank you very much!