For Exercise 4, i am meant to pass Q,K,V matrices to multi attention layer, issue is i have no idea where these matrices are, they were not defined as instance or class variables and were not passed in the function call, the parameters in the function call are self, x, training, and mask, kindly help me understand how to go about this, thanks
Welcome to the community.
What course are you refereeing to?
It turns out you posted on the general category.
ohh thats true, i’ll change it i am referring to deep learning specialization sequence models
For self-attention, the x matrix is used for each of K, Q, and V.