Inputs to lstm layer:

umeshchandra · March 30, 2022, 1:15pm

in the case of Neural Machine Translation (NMT) model to translate human-readable dates (“25th of June, 2009”) into machine-readable dates (“2009-06-25”).the post-attention LSTM at time 𝑡 does not take the previous time step’s prediction 𝑦⟨𝑡−1⟩ as input
if we require an attention model for which the post lstm layer should have inputs both y(t-1) (the previous output) and the context vector at that time step(for example in case of machine translation)
can we just concatenate both y(t-1) and context vectors or can we give two inputs to the lstm layer by using the call arguments “inputs”

TMosh · July 6, 2022, 5:28pm

Were you able to find an answer for your question?

Topic		Replies	Views
Machine Translation With Attention - Practice problem Sequence Models week-3	2	27	September 18, 2024
C5W3A1 NMT w/ attention - What if prev pred are fed into input along context Sequence Models week-3	8	314	April 16, 2024
Attention sequence model week3(make post attention steps output depend on prior step) Deep Learning Resources	1	91	October 7, 2022
Neural_machine_translation_with_attention_v4a concept check Sequence Models	2	386	August 16, 2023
Machine Translation adding more LSTM layers Sequence Models	1	508	June 15, 2022

Inputs to lstm layer:

Related topics