DLS Course 5 Week 4 Exercise 4

gillynatter · June 29, 2021, 12:37am

The first instruction says:

You will pass the Q, V, K matrices and a boolean mask to a multi-head attention layer. Remember that to compute self -attention Q, V and K should be the same.

But how do i get Q, V, K matrices. they are not included in the def call(self, x,…) parameters?

I keep running self_attn_output = self.mha(…) but i keep getting error messages.

Any help?

TMosh · June 29, 2021, 1:00am

Since this is self-attention, you use the ‘x’ variable for all three of those.

gillynatter · June 29, 2021, 1:05am

It worked! Thank you!

Topic		Replies	Views
Week 4 Encoder Layer Sequence Models coursera-platform	2	730	August 9, 2021
C5_W4_A1_Transformer_Subclass_v1 Exercise 4 Sequence Models coursera-platform	3	414	September 26, 2023
Course 5 - Week 4 - A1 - Exercise 4 - EncoderLayer Sequence Models week-4 , coursera-platform	2	40	August 13, 2024
Is there an additional weight matrix layer for K,Q and V Sequence Models coursera-platform	9	426	August 16, 2023
C5W4A1 - UNQC4 - EncoderLayer - ValueError: The first argument to `Layer.call` must always be passed Sequence Models coursera-platform	4	574	April 3, 2023