There are some points that I have not been able to develop understanding about, please help:-
- In loading the tensors, when the instruction says-
Notice that the data in
Y
is reordered to be dimension (ππ¦,π,90), where ππ¦=ππ₯. This format makes it more convenient to feed into the LSTM later.
I still dont understand why this is important and what doest it achieve?
- After the creating the model as such, for loss calculation the instruction states-
Youβll turn
Y
into a list, since the cost function expectsY
to be provided in this format.
list(Y)
is a list with 30 items, where each of the list items is of shape (60,90).
Where 60 is the batch size. So have we created a model that outputs sequences for 60 batches? How does this work at the time of inference?
I am thoroughly confused. Please help