C5_W4_A1_Transformer_Subclass_v1 Exercise 4

faraday · September 23, 2023, 6:34pm

For Exercise 4, i am meant to pass Q,K,V matrices to multi attention layer, issue is i have no idea where these matrices are, they were not defined as instance or class variables and were not passed in the function call, the parameters in the function call are self, x, training, and mask, kindly help me understand how to go about this, thanks

elirod · September 23, 2023, 6:46pm

Hi @faraday

Welcome to the community.

What course are you refereeing to?

It turns out you posted on the general category.

best regards

faraday · September 23, 2023, 6:48pm

ohh thats true, i’ll change it i am referring to deep learning specialization sequence models

TMosh · September 26, 2023, 1:29am

For self-attention, the x matrix is used for each of K, Q, and V.

Topic		Replies	Views
DLS Course 5 Week 4 Exercise 4 Sequence Models coursera-platform	2	714	June 29, 2021
Course 5 - Week 4 - A1 - Exercise 4 - EncoderLayer Sequence Models week-4 , coursera-platform	2	40	August 13, 2024
Week 4 Encoder Layer Sequence Models coursera-platform	2	730	August 9, 2021
Question on Transformers Sequence Models coursera-platform	3	531	July 16, 2023
C5W4 Transformer multi-head weight matrices Sequence Models coursera-platform	4	824	June 30, 2022

C5_W4_A1_Transformer_Subclass_v1 Exercise 4

Related topics