C5 W4 A1: AssertionError: Wrong masked weights

Aman · July 1, 2021, 5:41pm

Hi, I have tried all the solutions applied in the two threads link1 and link2 but none worked.
to be specific I did following:

applied -1e9 to the mask before adding up to scaled tensor.
used tf.keras.activations.softmax(scaled_attention_logits, axis=-1)
used tf.cast(tf.shape(k)[-1], tf.float32) to get dk
used tf.matmul(q, k, transpose_b=True)
but I am still getting error: AssertionError: Wrong masked weights

I also refreshed my notebook but same issue happened again.
Since I can’t post my full code so I am posting parts only. ( I can send my code in message if you want to have a look at it)

Ihme11 · July 1, 2021, 8:01pm

Reminder : … . Multiply (1. - mask) by -1e9 before applying the softmax.
Adding the wrong quantity to the scaled_attention_logits might be the problem.

Aman · July 2, 2021, 5:20am

Thanks @Ihme11 ,
This was the problems.

jlecornu · July 12, 2021, 2:46pm

@Ihme11 - Hi I have just hit this same issue.

I thought I had already applied this step in the below snippet:

# add the mask to the scaled tensor.
    if mask is not None:
        scaled_attention_logits += (1. - mask * -1.0e9)

Is this in the wrong place?

ans · July 13, 2021, 8:16am

operator precedence …

MiNeves · August 25, 2021, 1:34pm

Ahaha lost about 1h30 because of overlooking operator precedence, thanks ans

Mohamed_Nooh13 · January 29, 2022, 6:42pm

i have same issues how to over come it?

Topic		Replies	Views
C5 W4 A1: Wrong masked weights: scaled_dot_product_attention() Sequence Models	4	723	February 6, 2022
C5W4A1 scaled_dot_product_attention error Sequence Models	2	881	May 23, 2021
Wrong masked weights error Sequence Models	2	814	September 22, 2021
C5_W4, UNQ_C3 scaled_dot_product_attention Sequence Models	4	744	August 10, 2021
Course 5 week 4 dot product wrong masked weights Sequence Models	13	1192	October 2, 2023

C5 W4 A1: AssertionError: Wrong masked weights

Related topics