Hello,
I’m currently a bit lost in trying to use the MultiHeadedAttention.
My code currently looks like this:
# START CODE HERE
# mentor edit - code removed
# END CODE HERE
But I’m getting the error:
AssertionError: Wrong values when training=True
I tried reading the hints. But I don’t know where to the the values Q,V and K to pass to the mha. I thought the mha was supposed to calculate these internally.