Course 5 Week4 Exercise 4

yiwu.yt · June 2, 2021, 11:30pm

I got the following error with this code:
# START CODE HERE
# calculate self-attention using mha(~1 line)
attn_output = self.mha(x) # Self attention (batch_size, input_seq_len, embedding_dim)

TypeError: call() missing 1 required positional argument: ‘value’

What is the positional argument?

arosacastillo · June 4, 2021, 3:10pm

Hi yiwu,

Just go to the documentation of the MultiHeadAttention layer and pay attention to the example and how many arguments are passed

Good luck

Rosa

arosacastillo · June 4, 2021, 3:23pm

Besides the call function has a ‘mask’ parameter that you need to use somewhere… well the place to use it is exactly in that layer. Check the parameters of the MultiHeadAttention to find where to use it.

yiwu.yt · June 4, 2021, 4:10pm

Thank you for your reply.
It’s very helpful!!

Oleksandr_Semenov · December 23, 2021, 2:09am

I am also having a hard time with this exercise. Passing mask as an argument (attn_output = self.mha(x, mask)) gives me the following error
“InvalidArgumentError: cannot compute Einsum as input #1(zero-based) was expected to be a int64 tensor but is a float tensor [Op:Einsum]”

Reading the documentation I see that we need to pass two parameters to MultiHeadAttention. Number of heads I understand should be 3. One for each Q, V, and K. Then key dimension is the dimension of the K? Just passing integers into this function also gives an error. I am very confused now on how to use this function.

TMosh · December 23, 2021, 2:31am

self.mha(…) requires four parameters.
The Q, V, and K parameters, and the mask.
All three of Q, V, and K are the ‘x’ variable, since this is self-attention.

Oleksandr_Semenov · December 24, 2021, 4:15pm

Thank you @TMosh. This helped!

Topic		Replies	Views
DLS 5 Week 4 Ex 4 Sequence Models	1	537	December 12, 2021
Programming Assignment: Transformers Architecture with TensorFlow encoderlayer Sequence Models week-4	2	395	January 23, 2024
C5 W4 A1: Question about MultiHeadAttention Sequence Models	2	738	August 4, 2021
C5 W4: Exercise 4 EncoderLayer() At least need to know Sequence Models	13	638	December 3, 2022
C5W4A1 - UNQC4 - EncoderLayer - ValueError: The first argument to `Layer.call` must always be passed Sequence Models	4	572	April 3, 2023

Course 5 Week4 Exercise 4

Related topics