C5W4 Programming Ass UNQ_C3

walrus · August 6, 2021, 1:16am

I’m stuck. Can’t find the problem. I’m applying the mask and -1e9 when computing the scaled_attention_logits. I don’t understand why it is += instead of multiplication. Keep getting the following error.

I’m using tf.nn.softmax to compute attention_weights on the scaled_attention_logits, axis = 1.

AssertionError Traceback (most recent call last)
in
1 # UNIT TEST
----> 2 scaled_dot_product_attention_test(scaled_dot_product_attention)

~/work/W4A1/public_tests.py in scaled_dot_product_attention_test(target)
73 assert np.allclose(weights, [[0.30719590187072754, 0.5064803957939148, 0.0, 0.18632373213768005],
74 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862],
—> 75 [0.3836517333984375, 0.3836517333984375, 0.0, 0.2326965481042862]]), “Wrong masked weights”
76 assert np.allclose(attention, [[0.6928040981292725, 0.18632373213768005],
77 [0.6163482666015625, 0.2326965481042862],

AssertionError: Wrong masked weights

TMosh · August 6, 2021, 1:52am

Did you read this part of the instructions carefully?

walrus · August 6, 2021, 2:41am

I read, multiple them, so I have the following

(1.-mask) * -1e9

TMosh · August 6, 2021, 2:51am

Try using tf.keras.activations.softmax(…) instead. You don’t need an “axis” argument.

walrus · August 6, 2021, 3:50am

wow! so I don’t need an axis, that solved it, thank you!

walrus · August 6, 2021, 3:52am

Why is axis a problem here?

TMosh · August 6, 2021, 6:27am

If you’re going to specify an axis, you need to be sure it’s the correct one.
In this case, the default axis is the correct one.

wayne9598 · August 6, 2021, 3:08pm

what does this:

Multiply (1. - mask) by -1e9 before applying the softmax.

even mean??

Topic		Replies	Views
C5_W4A1 scaled_dot_product_attention wrong masked values Sequence Models week-4	3	44	September 19, 2024
Course 5 week 4 dot product wrong masked weights Sequence Models	13	1192	October 2, 2023
Scales dot product attention Sequence Models	2	952	June 18, 2021
C5_W4_A1 scaled_dot_product_attention mask issue Sequence Models	3	1011	November 15, 2022
C5 W4 A1: Wrong masked weights: scaled_dot_product_attention() Sequence Models	4	722	February 6, 2022

C5W4 Programming Ass UNQ_C3

Related topics