#C4W2 - Exercise 4 Transformer error

Ramkumar_Anantha · February 20, 2024, 3:34pm

Hi,
For encoder, I passed the arguments input_sentence(input to the encoder), look_ahead_mask (mask for the target input) and enc_padding_mask and for decoder I passed the arguments encoder_output(the output of encoder serves as input for the mha) and dec_padding_mask(serves as boolean mask for second mha layer). I’m getting the following errors. It will be really helpful if you can pinpoint the mistake.

TMosh · February 20, 2024, 3:48pm

I believe the “training” variable should be a boolean, not a tensor of shape (1,7,7)

Ramkumar_Anantha · February 20, 2024, 4:49pm

Thank you so much for the timely response. I traced back to where this self.dropout is declared which is in class Encoder. Within that the documentation it states , “training (bool): Boolean, set to true to activate the training mode for dropout layers”

TMosh · February 20, 2024, 5:10pm

It appears that it was modified somehow by the time it got to your code in transformer.call().

Ramkumar_Anantha · February 20, 2024, 5:13pm

Hi TMosh. Thanks for helping out. I went back to the decoder and encoder classes to see the arguments that each function call within the class accepts. Turns out, I was wrong. Encoder needs the input sentence, training flag and enc_padding_mask. Thank you for helping me debug it!

TMosh · February 20, 2024, 5:27pm

Nice work!

Ramkumar_Anantha · February 20, 2024, 5:33pm

Fingers crossed the 0/60 grading problem doesn’t arrive

TMosh · February 20, 2024, 5:53pm

I’m not a mentor for this course, so I don’t recognize what the “0/60 grading problem” refers to.

Maybe a course mentor will reply here.

Topic		Replies	Views
Week 4_Ex4_Transformer_Subclass_Encoderlayer Sequence Models week-4	17	289	March 4, 2024
C4W2-Exercise2 NLP with Attention Models week-2	10	281	May 27, 2024
C5 W4 A1: Wrong values when training=True Sequence Models	2	656	February 4, 2022
C5_W4_A1 UNQ_C4 Encoder Layer Mask Sequence Models	16	1063	August 3, 2021
[Week4]C5_W4_A1_Transformer_Subclass_v1 Exercise 4 - EncoderLayer Sequence Models	19	1585	August 17, 2024

#C4W2 - Exercise 4 Transformer error

Related topics