DeepLearning.AI
Attention Output shape
Course Q&A
Deep Learning Specialization
Sequence Models
coursera-platform
TMosh
May 11, 2022, 1:47am
10
Also, I think you will be interested in the discussion at this thread:
show post in topic
Related topics
Topic
Replies
Views
Activity
Query Input Last Dimension
Sequence Models
coursera-platform
10
608
May 12, 2022
C5W4 Assignment: Multi-head attention output dimension
Sequence Models
week-module-4
,
coursera-platform
2
292
January 18, 2024
Q about keras doc of tf.keras.layers.MultiHeadAttention
Sequence Models
coursera-platform
6
565
July 18, 2021
Transformer: dimensions of encoder output and decoder Q matrix
Sequence Models
coursera-platform
1
610
April 21, 2022
W4 Assignment-Exercise6; why shape after second Add&Norm Layer is (batch_size, n_target, full_connected_dim) not (batch_size, n_target, d_model)?
Sequence Models
coursera-platform
2
449
October 12, 2023