C4W1_Assignment exercise 3 - decoder

Deepti_Prasad · May 24, 2024, 5:56am

for the code line

Pass the embedded input into the pre attention LSTM
Hints:
The LSTM you defined earlier should return the output alongside the state (made up of two tensors)
Pass in the state to the LSTM (needed for inference)

the argument to call the decode layer clearly states you to use state as None, but you have used initial_state=state causing this error. It should be initial_state=None.

another issue I can see for code line
Get the embedding of the input, you have used incorrect input, remember it is right shifted translation used as input.
For the below code line
Perform cross attention between the context and the output of the LSTM (in that order)
YOU ARE SUPPOSE TO USE THE RECALLED ATTENTION LAYER FOR CrossAttention and not CrossAttention directly

Regards
DP

Topic		Replies	Views
C4W1 Assigment - Decoder part - Dimension problem NLP with Attention Models week-1	2	33	March 21, 2025
Exercise 3 error: Incorrect shape of decorder output NLP with Attention Models week-1	2	22	January 19, 2025
C4W1_Assignment - Translator NLP with Attention Models week-1	2	387	March 20, 2024
NMT with Attention Model NLP with Attention Models	2	390	January 2, 2024
NMT with attention : Failed test case: w1_unittest.test_encoder(Encoder) NLP with Attention Models week-1	9	27	December 19, 2024

C4W1_Assignment exercise 3 - decoder

Related topics