About encoder structure in C4_W3_Assignment

Richard_Tsai · June 27, 2023, 7:25am

In the notebook image the structure is depicted as Attention layer then add residual and then layer norm. However, in the coding part it is structured as :
# addResidual layer tl.Residual( # add norm layer tl.LayerNorm(), # add attention attention, # add dropout dropout_, ),

Why is that?

arvyzukai · June 27, 2023, 8:07am

Hi @Richard_Tsai

That is a good question The short answer is that the picture is from the original Attention Is All You Need paper while the Assignment implementation is a newer (better) version of it.

The image is taken from here and you can read more about it if you’re interested.

Cheers

Topic		Replies	Views
Layer order in Residual block of UNQ_C6 NLP with Attention Models week-module-2	3	615	April 11, 2023
Encoder Block ResNet Sequence Models coursera-platform	1	519	May 7, 2022
Questions regarding course 4 week 1 NLP with Attention Models week-module-1	1	585	August 3, 2022
Error in C4W2- Exercise 2 NLP with Attention Models week-module-2	2	66	August 19, 2024
Residual Layer in Assignment NLP with Attention Models week-module-1	1	525	August 2, 2022

About encoder structure in C4_W3_Assignment

Related topics