Padding mask shape problem

liangyuantong · June 25, 2021, 8:28am

In 2.1 - Padding Mask the assiement shows us the difference between computing softmax directly and computing softmax with the addition of negative infinity.

print(tf.keras.activations.softmax(x))
print(tf.keras.activations.softmax(x + create_padding_mask(x) * -1.0e9))

The second line of code results in a shape problem ， the matrix A cannot be added to B
c,It should be changed to the following：
print(tf.keras.activations.softmax(x[:, tf.newaxis, tf.newaxis, :] + create_padding_mask(x) * -1.0e9))

TMosh · June 25, 2021, 1:56pm

I think you don’t need to create the padding mask. Isn’t it passed to the function as an argument?

Topic		Replies	Views
Course5_week4 Size of mask after softmax Sequence Models coursera-platform	6	688	March 15, 2025
Why does applying the padding mask change the tensor's shape C5W4Asn1 Sequence Models coursera-platform	2	555	January 21, 2023
C5w4 2.1 Padding mask Sequence Models week-module-4 , coursera-platform	9	291	March 9, 2024
Create_padding_mask() function Sequence Models week-module-4 , coursera-platform	3	30	August 16, 2024
C5_W4 Masking issue (?!) Sequence Models week-module-4 , coursera-platform	2	140	May 16, 2024

Padding mask shape problem

Related topics