I think there might be an issue with w2_tests.test_compute_attention_output_closure or I did some mistake I can’t find.
Both test cases are using same x variable but n_heads and d_head variables are different, consequentially I’m getting InconclusiveDimensionOperation: Cannot divide evenly the sizes of shapes (6, 2, 3) and (2, 3, 2, 2) since d_head is 2 but actual last dimension of x is 3.
I’ve tried to use x.shape[2] instead of d_head but in this case the test case fails with wrong result error.
Can you share your lab ID with me ? In the assignment, when you click the top right “Help” button, a panel will open and your lab ID will be shown at the bottom.
I shall take a look.
When you reply back, kindly tag me in the post so that I’m notified.