C4_W2 Programming Assignment: Transformer Summarizer error grading :a problem compiling the code from your notebook. Details: Exception encountered when calling layer 'softmax_3' (type Softmax)

arvyzukai · February 19, 2024, 2:06pm

Note, that when creating padding mask for the decoder’s second attention block - we use the encoder_input. In other words, we inform the decoder to not pay attention to padding tokens of the document to be summarized.

Also note, that this is different from look_ahead_mask (causal mask) where decoder is only allowed to pay attention to itself and its previous tokens.

Cheers

Topic		Replies	Views
C4W2 cannot graded NLP with Attention Models week-2	14	906	October 29, 2024
#C4W2_Assignment Transformer Summarizer NLP with Attention Models week-2	5	65	September 21, 2024
#C4W2 - Exercise 4 Transformer error NLP with Attention Models week-2	7	301	February 20, 2024
C4_W2 - Assignment 2: Transformer Summarizer - Exercise 2: DecoderLayer NLP with Attention Models week-2	2	223	May 13, 2024
C4-w2 assingmenr grader][; NLP with Attention Models week-2	9	221	May 27, 2024

C4_W2 Programming Assignment: Transformer Summarizer error grading :a problem compiling the code from your notebook. Details: Exception encountered when calling layer 'softmax_3' (type Softmax)

Related topics