C5_W4A1 scaled_dot_product_attention wrong masked values

Sajjad_Ali · September 18, 2024, 9:49am

AssertionError Traceback (most recent call last)
in
1 # UNIT TEST
----> 2 scaled_dot_product_attention_test(scaled_dot_product_attention)

~/work/W4A1/public_tests.py in scaled_dot_product_attention_test(target)
73 assert np.allclose(weights, [[0.30719590187072754, 0.5064803957939148, 0.0, 0.18632373213768005],
74 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862],
—> 75 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862]]), “Wrong masked weights”
76 assert np.allclose(attention, [[0.6928040981292725, 0.18632373213768005],
77 [0.6163482666015625, 0.2326965481042862],

AssertionError: Wrong masked weights

any idea where I am doing wrong here it took all day but I am fail to find where I am doing wrong?

Deepti_Prasad · September 18, 2024, 11:42am

hi @Sajjad_Ali

Check the below linked comment

regards
DP

nadtriana · September 18, 2024, 12:00pm

@Deepti_Prasad is right. Add the mask to the scaled tensor before applying the softmax. The masking operation should look like this:

if mask is not None:

     scaled_tensor += ((1.0 - mask) * -1e9)

Sajjad_Ali · September 19, 2024, 6:13am

Ah, thanks that was the issue, I just overlooked the instruction and it wasn’t mention in the lecture, thank you for highlighting.

Topic		Replies	Views
C5_W4_A1 scaled_dot_product_attention assistance requested Sequence Models coursera-platform	3	490	August 16, 2023
Deep Learning Specialization- Course 5- Week 4 Sequence Models coursera-platform	1	650	October 2, 2021
Help needed for C5W4A1 EX-3 Sequence Models week-module-4 , coursera-platform	4	51	August 17, 2024
Course 5 week 4 dot product wrong masked weights Sequence Models coursera-platform	13	1233	October 2, 2023
C5_W4, UNQ_C3 scaled_dot_product_attention Sequence Models coursera-platform	4	759	August 10, 2021

C5_W4A1 scaled_dot_product_attention wrong masked values

Related topics