The hint is a bit misleading, even I was stuck because of the same, arvyzukai NLP mentor guided me through
So based on the instruction previous shared, axis should be 1
For the mask exclude positives, your code is correct.
Next thing, check if the score codes is recalled as (v2, v1, transpose_b=true), note v2 is placed before v1