C5 W4 A1 E3AssertionError: Wrong unmasked weights

Bimal_Viswam · February 24, 2024, 12:43am

I am getting this error in the scaled_dot_product_attention function .

attention weights - (3, 4)
V shape - (4, 2)
<class ‘tensorflow.python.framework.ops.EagerTensor’>

AssertionError Traceback (most recent call last)
in
1 # UNIT TEST
----> 2 scaled_dot_product_attention_test(scaled_dot_product_attention)

~/work/W4A1/public_tests.py in scaled_dot_product_attention_test(target)
60 assert np.allclose(weights, [[0.2589478, 0.42693272, 0.15705977, 0.15705977],
61 [0.2772748, 0.2772748, 0.2772748, 0.16817567],
—> 62 [0.33620113, 0.33620113, 0.12368149, 0.2039163 ]]), “Wrong unmasked weights”
63
64 assert tf.is_tensor(attention), “Output must be a tensor”

AssertionError: Wrong unmasked weights

Could not figure out the cause of this in the code. Any help is appreciated.

balaji.ambresh · February 24, 2024, 3:50am

If none of these search results help, please click my name and message your notebook as an attachment.

Bimal_Viswam · February 24, 2024, 8:19pm

I have sent you the notebook

balaji.ambresh · February 25, 2024, 1:49pm

Please fix the following:

When calculating the matmul_qk, k is not multiplied directly. It’s transformed before multiplication. See the equation in the markdown for details.
It’s safer to use negative indexing to find dk since earlier dimeisions are not fixed.
For if mask is not None case, read this hint: Multiply (1. - mask) by -1e9 before applying the softmax.
Calculation of output is incorrect. Go back to the equation and notice that there are only 2 terms and not 3.

Bimal_Viswam · February 27, 2024, 1:38pm

Balaji,

Thanks for the response .

in this -

For if mask is not None case, read this hint: Multiply (1. - mask) by -1e9 before applying the softmax.
Where is this -1e9 factor coming from? What do we need to do?
does it read “(1 dot -mask) * -1e9”

Thanks
Bimal

balaji.ambresh · February 27, 2024, 2:18pm

See this:

Topic		Replies	Views
Course 5 Week 4 Exercise 3 :AssertionError: Wrong unmasked weights Sequence Models	1	413	October 20, 2023
C5 W4 A1: Wrong masked weights: scaled_dot_product_attention() Sequence Models	4	717	February 6, 2022
Scaled_dot_product_attention(q, k, v, mask) function: Sequence Models	3	717	August 29, 2021
Course 5 week 4 dot product wrong masked weights Sequence Models	13	1190	October 2, 2023
C5_W4A1 scaled_dot_product_attention wrong masked values Sequence Models week-4	3	39	September 19, 2024

C5 W4 A1 E3AssertionError: Wrong unmasked weights

attention weights - (3, 4) V shape - (4, 2) <class ‘tensorflow.python.framework.ops.EagerTensor’>

Related topics

attention weights - (3, 4)
V shape - (4, 2)
<class ‘tensorflow.python.framework.ops.EagerTensor’>