Week4 (Transformer Network), Exercise 4

srikanthm · July 6, 2021, 9:55pm

Hello,

I am getting below error when trying to do the step ‘apply dropout layer to the self-attention output’. I am using self.dropout1 and passing along the self_attn_output from the previous step and the training parameter. Don’t understand what I am doing wrong Please help.

“Attempt to convert a value (<tensorflow.python.keras.layers.multi_head_attention.MultiHeadAttention object at 0x7feb0816e850>) with an unsupported type (<class ‘tensorflow.python.keras.layers.multi_head_attention.MultiHeadAttention’>) to a Tensor.”

TMosh · July 6, 2021, 11:48pm

Are you using “training = training”?
Perhaps the problem could be with your code that computes the self_attn_output.

srikanthm · July 6, 2021, 11:50pm

Hello Tom ,

I just resolved this, yes, the problem was with the step before it using the ‘mha’. Thank you.

Topic		Replies	Views
Week 4 Assignment 1 Transformers Architecture with TensorFlow Exercise 8 Transformer Sequence Models coursera-platform	7	1062	July 13, 2021
C5W4A1 exercise 4,5 Sequence Models coursera-platform	7	607	December 17, 2022
C5 W4 A1: Encode layer dropout error Sequence Models coursera-platform	2	671	January 30, 2022
C5_W4_A1_Transformer_Subclass_v1 UNQ4 Sequence Models coursera-platform	19	1019	March 10, 2022
C5 W4 UNQ_C4 Wrong values when training=True Sequence Models coursera-platform	14	1719	June 6, 2023

Week4 (Transformer Network), Exercise 4

Related topics