C5_W4_A1_Ex3_Poor_Instruction

mrgransky · July 21, 2023, 10:54am

Exercise 3 instruction is not informative at all, does not help and bring lots of frustration!

what does scale_matmul_qk really mean? isn’t it easier to say dk = tf.shape(k)[-1] # seq_len_k which btw returns error!
There were no clear explanations regarding Multiply (1. - mask) by -1e9 before applying the softmax. Why is it exactly (1. - mask)*-1e9 and not just mask*-1e9 ?
Could have been better to add tf.nn.sotfmax(..., axis=...) in the additional hints to remind using softmax from tensorflow!
Could have been better to add tf.cast(x, dtype, name=None) in the additional hints to explain why it is required to change dk’s type to ignore the InvalidArgumentError: Value for attr 'T' of int32 is not in the list of allowed values!

Cheers,

TMosh · July 21, 2023, 2:23pm

Thanks for your list of issues.

Ahusu · February 7, 2024, 9:55pm

I got stuck for more than 2 hours, when I missed the Reminder: The boolean mask parameter can be passed in as none or as either padding or look-ahead.

Multiply (1. - mask) by -1e9 before applying the softmax.

bit…

Topic		Replies	Views
C5_W4_A1 assignment Exercise 3 Sequence Models	5	424	February 8, 2024
C5 W4 A1 Ex-3 Questions (scaled_dot_product_attention) Sequence Models	6	596	October 18, 2022
I have stuck with Course 5 Week 4 Assignments1 Ex3 Sequence Models week-4	9	629	August 17, 2024
C5_W4_A1 scaled_dot_product_attention "wrong unmasked weights" Sequence Models week-4	3	57	July 10, 2024
C5 W4 A1: AssertionError: Wrong masked weights Sequence Models	6	1020	January 29, 2022

C5_W4_A1_Ex3_Poor_Instruction

Related topics