C4W2 cannot graded

arvyzukai · January 15, 2024, 8:44am

For future learners - the OP’s mistake was in defining the dec_padding_mask.

Note, that when creating padding mask for the decoder’s second attention block - we use the encoder_input. In other words, we inform the decoder to not pay attention to padding tokens of the document to be summarized.

Also note, that this is different from look_ahead_mask (causal mask) where decoder is only allowed to pay attention to itself and its previous tokens.

Cheers

Topic		Replies	Views
C4-w2 assingmenr grader][; NLP with Attention Models week-module-2	9	221	May 27, 2024
C4W2 - Grading Error NLP with Attention Models	4	547	February 13, 2024
C4W2 - InvalidArgumentError: Exception encountered when calling layer 'softmax_157' NLP with Attention Models week-module-2	2	35	September 17, 2024
Natural Language Processing with Attention Models C4W2_Assignment All tests pass but all scores are 0 NLP with Attention Models week-module-2	3	35	August 6, 2024
C4W2 Assignment DecoderLayer NLP with Attention Models week-module-2	7	538	April 19, 2024

C4W2 cannot graded

Related topics