Transformer Network - Question about "N"

haleh · May 10, 2024, 7:52pm

In the picture attached how we repeat the encode and decoder N times?
Is this the “N” repetition of “encoder” and “decoder” parallel or sequential? what is the input and output of each stack?

Transformer Network | Coursera

TMosh · May 10, 2024, 10:17pm

What is the video title and what is the time mark?

haleh · May 11, 2024, 5:37am

The video name is “Transformer Network”.

At two points, “N” repetition has been mentioned.

at 1:45
at 3:57

TMosh · May 11, 2024, 4:28pm

Sorry, I’m not able to access any of the Week 4 materials right now. I don’t understand why.

paulinpaloalto · May 11, 2024, 7:43pm

Hmmm, I just checked and I can see the lectures. I haven’t watched them in a while, so I’ll need to go through them again, but I won’t be able to do that until later today because of other commitments.

If I had to guess from my memory it would be that the N probably refers to the timesteps in the input. One of the big differences with Attention that makes it more powerful than the classic RNN/GRU/LSTM is that it handles the timesteps in parallel. In the typical case with a sentence as input “timesteps” map to “words” of course.

haleh · May 12, 2024, 3:52am

Thanks for your reply.
The N refers to the numbers of repetitions of each one of the encoder and decoder block in the transofmer structure.
The thing I dont get is how they will be repeated, is it sequential or parallel and either case what is the input/output specifically.

Topic		Replies	Views
Week 4: Transformer Network (test time intuition) Sequence Models coursera-platform	1	516	April 21, 2022
C5_W4 Transformer - Flummoxed. Why do we pass the output sentence to the decoder Sequence Models coursera-platform	6	531	May 17, 2023
Architecture of Transformer Network Sequence Models coursera-platform	1	530	June 3, 2022
Vector in Encoder (Transformer Architecture) AI Discussions	1	104	December 26, 2023
What does seq2seq mean in Transformer? Generative AI with Large Language Models week-module-1 , week-module-2	2	388	April 23, 2024

Transformer Network - Question about "N"

Related topics