Hello community,
I pass the unit test reading several topics on forum but I don’t understand what I am doing and why is it working…
Can you explain me why the argument of self.mha() is x,x,x,mask ?
self_mha_output = self.mha(x,x,x,mask)
When I look into the function init I see
self.mha = MultiHeadAttention(num_heads=num_heads, key_dim=embedding_dim, dropout=dropout_rate)
I don’t understand where come from the
self.mha(query=..., value=..., key=... attention_mask=...)
Best regards