Course 5 week 4 dot product wrong masked weights

newton · July 20, 2021, 11:46pm

I did as you mentioned but I still get the following error

I did follow the instruction provided and the TensorFlow library instruction

AssertionError Traceback (most recent call last)
in
30 print("\033[92mAll tests passed")
31
—> 32 scaled_dot_product_attention_test(scaled_dot_product_attention)

in scaled_dot_product_attention_test(target)
23 assert np.allclose(weights, [[0.30719590187072754, 0.5064803957939148, 0.0, 0.18632373213768005],
24 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862],
—> 25 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862]]), “Wrong masked weights”
26 assert np.allclose(attention, [[0.6928040981292725, 0.18632373213768005],
27 [0.6163482666015625, 0.2326965481042862],

AssertionError: Wrong masked weights

TMosh · July 21, 2021, 1:10am

Your weight values are incorrect.
There is an error in your code.

newton · July 21, 2021, 1:25am

I knew that, and I can’t figure it out that’s why i wrote this post, could you please help me specify what’s wrong?
i did use the following:
(mask * -1e9)
tf.keras.activations.softmax(scaled_attention_logits, axis=-1)
Appreciate your assistance

TMosh · July 21, 2021, 2:28am

From the instructions:

newton · July 21, 2021, 3:18am

thanks, that’s helpful & solve the issue
may i ask why we changed the value than the recommended one in the library
is it something based on trial & error or based on some analysis that i should know about?

TMosh · July 21, 2021, 7:52am

The exercise was updated by the instructors.

andree · August 20, 2021, 8:06am

The term (1. - mask) is confusing. You might want to remove the dot after the number 1.

TMosh · August 20, 2021, 4:24pm

The dot indicates that the ‘1’ is a floating point value, not an integer.

andree · August 20, 2021, 4:35pm

I didn’t get this hint. I don’t think it’s necessary for the 1 to be a float because the term is multiplied by the float -1e9. I’m not criticizing, only giving feedback.

TMosh · August 20, 2021, 4:46pm

Thanks for your sugggestion.

Wenduan_Mou · April 1, 2022, 2:36am

I have got the same error, and solved by TMosh’s hint.

mrgransky · July 21, 2023, 11:29am

why is it exatly (1.-mask)*-1e9 and not just simply mask*-1e9 ?

Shivam_Shinde · October 2, 2023, 6:17am

where are we supposed to use (1-mask)*-1e9 in code? I tried it while adding mask to the scaled_attention_logits, but it is showing me the ‘wrong masked weights’ error. Could you please help me with this issue?

TMosh · October 2, 2023, 6:32am

In the code for scaled_dot_product_attention(), where it has the comment “add the mask to the scaled tensor”.

Topic		Replies	Views
Course 5 Week 4 Exercise 3 :AssertionError: Wrong unmasked weights Sequence Models coursera-platform	1	425	October 20, 2023
Scaled_dot_product_attention(q, k, v, mask) function: Sequence Models coursera-platform	3	731	August 29, 2021
Deep Learning Specialization- Course 5- Week 4 Sequence Models coursera-platform	1	639	October 2, 2021
C5 W4 A1 E3AssertionError: Wrong unmasked weights Sequence Models week-module-4 , coursera-platform	5	361	February 27, 2024
C5 W4 A1: Wrong masked weights: scaled_dot_product_attention() Sequence Models coursera-platform	4	726	February 6, 2022

Course 5 week 4 dot product wrong masked weights

Related topics