Another improvement suggestion in C5_W4_A1_Transformer_Subclass_v1

zshao · February 1, 2023, 7:10pm

print(tf.keras.activations.softmax(x))
print(tf.keras.activations.softmax(x + (1 - create_padding_mask(x)) * -1.0e9))

should be

print(tf.keras.activations.softmax(x))
print(tf.keras.activations.softmax(x + (1 - tf.squeeze(create_padding_mask(x), axis=1)) * -1.0e9))

in order to match the size of x.

TMosh · February 2, 2023, 7:40am

Thanks for the suggestion.

Topic		Replies	Views
C5-W4-A1 Revision Suggestion Sequence Models	1	514	January 18, 2023
C5w4 2.1 Padding mask Sequence Models week-4	9	288	March 9, 2024
C5_W4 Masking issue (?!) Sequence Models week-4	2	136	May 16, 2024
Course5_week4 Size of mask after softmax Sequence Models	6	688	March 15, 2025
C5_W4_A1_Ex3_Poor_Instruction Sequence Models	2	468	February 7, 2024