Transformer Architecture: Fully connected Dimension Exercise 4

Anbu · May 14, 2022, 10:48am

Hi Mentor,

In the encoder layer, exercise 4, why input x having shape (Batchsize,input_seq_len,fully_connected_dimns) ?

Because input x should have embedding dimension right ?

x – Tensor of shape (batch_size, input_seq_len, fully_connected_dim)

Elemento · May 14, 2022, 11:03am

Hi @Anbu,
It’s always better if you mention Week and Assignment numbers in your title, so that the mentors and the fellow learners can easily understand and learn from your query. You can rename your title using the little pencil icon next to your title.

Regards,
Elemento

Anbu · May 14, 2022, 12:27pm

Okay sir,

This is trasformer architecture programming assignment. Exercise 4

Elemento · May 14, 2022, 12:53pm

Hi @Anbu,
You don’t need to refer to me as “Sir”. I am also a fellow learner, just like you . As for your query, I have requested the other mentors to take a look at it once, as I am not very well-versed with Transformers.

Regards,
Elemento

reinoudbosch · May 16, 2022, 11:57pm

Hi Anbu,

Yes, this should be (batchsize, input_seq_len, embedding_dim). Thanks for noticing!

Topic		Replies	Views
Course 5 - Week 4 - understanding EncoderLayer dimensions Sequence Models	2	1223	May 14, 2021
W4 Assignment 1 Exercise 8 Are the input dimensions of our transformer model correct Sequence Models week-4	2	248	January 9, 2024
Wrong comments in the assignment of C4W2 NLP with Attention Models general	3	77	June 19, 2024
C5W4 Questions after finish the course Sequence Models	5	263	December 30, 2023
C5_W4 Transformers slim material + assignment bugs Sequence Models	2	712	February 3, 2023

Transformer Architecture: Fully connected Dimension Exercise 4

Related topics