[C5-W4] Hi everyone! I hope someone will be able to help me with the problem I have been facing for days with the scaled_dot_product_attention function. I have added some print statements so the shapes of multiple variables appear at the top of the output. I’ve tried a lot of things and have cur…

Week 4 A1 problem with scaled_dot_product_attention

Course Q&A Deep Learning Specialization Sequence Models

Deepti_Prasad September 6, 2024, 11:58pm 7

(3,4) is expected at the shape where codes are asserting it incorrectly.

So you need to check when the weights asserted to the q.shape[0] and k.shape[1], the shapes are not matching here, perhaps check codes on how you scaled the matmul_qk or where you add mask to the scaled tensor (missed an extra tuple??)

Topic		Replies	Views
Week 4 Scaled Dot Product Attention Sequence Models	10	804	October 31, 2021
Scaled_dot_product_attention q, k, and v dimensions not correct Sequence Models	4	451	July 21, 2023
C5 W4 A1 E3 help me I don't understand the dimensions of scaled_dot_product_attention Sequence Models week-4	3	267	February 5, 2024
C5 W4 Lab1 E3 Sequence Models week-4	8	19	March 25, 2025
Week 4: scaled_dot_product_attention Sequence Models	3	904	August 5, 2021

Week 4 A1 problem with scaled_dot_product_attention

Related topics