Scaled_dot_product_attention(q, k, v, mask) function:

I get this error: AssertionError: Wrong masked weights

Can you please elaborate on your problem?

Maybe you use the tensorflow tutorial:

Never mind, I figured it out

1 Like