Wrong masked weights error

ahpatil11 · September 22, 2021, 12:40pm

Hi
I am in the scaled_dot_product_attention() and i am getting error as wrong masked weights

Below is the code for the function
Let me know how to proceed
matmul_qk = tf.matmul(q, k, transpose_b = True) # (…, seq_len_q, seq_len_k)

# scale matmul_qk
dk = tf.cast(tf.shape(k)[-1], tf.float32)
scaled_attention_logits = matmul_qk / tf.math.sqrt(dk)

# add the mask to the scaled tensor.
if mask is not None: # Don't replace this None
    scaled_attention_logits += (mask * -1e9) 

# softmax is normalized on the last axis (seq_len_k) so that the scores
# add up to 1.
attention_weights = tf.nn.softmax(scaled_attention_logits, axis = -1)  # (..., seq_len_q, seq_len_k)

output = tf.matmul(attention_weights, v)  # (..., seq_len_q, depth_v)

Kic · September 22, 2021, 5:57pm

Hi @ahpatil11

dk is the number of keys used in the matrix ‘k’, so it is obtained by np.shape(k)[0]

TMosh · September 22, 2021, 9:45pm

I think you also need to subtract the mask from 1.

Topic		Replies	Views
C5 W4 A1: Wrong masked weights: scaled_dot_product_attention() Sequence Models coursera-platform	4	727	February 6, 2022
C5_W4_A1 scaled_dot_product_attention "wrong unmasked weights" Sequence Models week-module-4 , coursera-platform	3	65	July 10, 2024
C5 W4 A1 E3AssertionError: Wrong unmasked weights Sequence Models week-module-4 , coursera-platform	5	365	February 27, 2024
Scaled_dot_product_attention(q, k, v, mask) function: Sequence Models coursera-platform	3	731	August 29, 2021
C5 W4 A1: AssertionError: Wrong masked weights Sequence Models coursera-platform	6	1023	January 29, 2022

Wrong masked weights error

Related topics