Attention sequence model week3(make post attention steps output depend on prior step)

Manish_Shukla · September 9, 2021, 10:32pm

hi, i finished the assignment of attention sequence model (week3), where we built attention model which translates dates written by human into mm-dd-yyyy format. in this model the post attention lstm step did not have input from prior step as it makes sense for this model. i have two questions:

code changes required to add additional input to post attention lstm cells other than the hidden state and context. i tried but was struggling a little so would love to get some pointers and will try myself later too.
we could train this new architecture and i am assuming the model will just make the dependency on prior output very small and still work. thoughts?

thanks much,
Manish

Topic		Replies	Views
Machine Translation With Attention - Practice problem Sequence Models week-3	2	20	September 18, 2024
Inputs to lstm layer: Sequence Models	1	554	July 6, 2022
Neural_machine_translation_with_attention_v4a concept check Sequence Models	2	385	August 16, 2023
Week 3, Course 5, Programming Assignment 1 Sequence Models	13	1081	January 5, 2023
Course 5 Week 3 Assignment 1 modelf Sequence Models	34	1028	September 1, 2024

Attention sequence model week3(make post attention steps output depend on prior step)

Related topics