Decode Layer - Exercise 2 - General question

ankit222 · February 27, 2025, 5:10pm

I have a general question about the class definition. For example, in the DecodeLayer class, in init function, self.mha1 is defined. In the call function in the class, when we use the self.mha1 function, how do we know which arguments the self.mha1 is supposed to take? I am confused about the definition of function/method in init and its use in call function.

Thanks.

gent.spah · February 28, 2025, 7:24am

Hello, in Tesnorflow multi-head attention link provided in the lab, it expalins the arguments that are used to call that function, here have a look:

Note here Q, K, V are the same ie. x.

ankit222 · February 28, 2025, 7:56am

I had seen the documentation previously. However, to clarify my original question, why do we define the self.mha1 and self.mha2 in the init function? It seems redundant to have the mha in both the init and call function?

gent.spah · February 28, 2025, 8:01am

In the innit definition they are initialising the mha, and then in the call is being called!

Topic		Replies	Views
C5_w5_a1 unq_c4 Sequence Models	2	543	February 1, 2023
NLP - C4W1 - A couple questions related to the assignment NLP with Attention Models week-4	4	63	August 23, 2024
Programming Assignment: Transformers Architecture with TensorFlow encoderlayer Sequence Models week-4	2	395	January 23, 2024
C5 W4: Exercise 4 EncoderLayer() At least need to know Sequence Models	13	638	December 3, 2022
C5 W4 A1 EncoderLayer arguments for self.mha Sequence Models	4	586	May 18, 2023

Decode Layer - Exercise 2 - General question

Related topics