Course 5 - Week 4 - A1 - Exercise 4 - EncoderLayer

leo76 · August 11, 2024, 10:25pm

Hello,
I would be grateful if you could please explain why Q, V, and K matrices are all the same ‘x’ matrix. Tx

TMosh · August 12, 2024, 3:05am

That’s how self-attention is defined.

leo76 · August 13, 2024, 10:29pm

thanks - I understand self attention. I was just trying to get my head around the syntax. I guess the weight matrices are not exposed as they are trainable. Also, having completed the assignment, I now understand that the inputs for Q, V and K can differ i.e. in the decoder.

Topic		Replies	Views
Question on Transformers Sequence Models	3	531	July 16, 2023
C5_W4_A1_Transformer_Subclass_v1 Exercise 4 Sequence Models	3	408	September 26, 2023
DLS Course 5 Week 4 Exercise 4 Sequence Models	2	714	June 29, 2021
Q,K,V all are same for self attention Sequence Models	5	649	November 19, 2023
[Week 4] - Lab - Self Attention Sequence Models	1	625	June 4, 2021

Course 5 - Week 4 - A1 - Exercise 4 - EncoderLayer

Related topics