Explaination of Feedforward network in encoder and decoder

bshree · December 3, 2024, 5:37pm

Ques. In attention is all you need lesson, the use of feed forward network in the encoder and decoder module is not quiet clear. It will be helpful if someone can explain it clearly. I did not understand the explaination given in the reading material too.

Link: https://www.coursera.org/learn/generative-ai-with-llms/lecture/R0xbD/generating-text-with-transformers

Girijesh · July 30, 2025, 4:54pm

Dear @bshree,

May I confirm if your concern has been resolved, or if you still require assistance?

–
Keep Learning AI with DeepLearning.AI - Girijesh

dhawalkapil · July 30, 2025, 8:09pm

Feedforward Neural Networks (FNNs) are the most commonly used neural network type where information flows in a single direction (input layer receives the data, hidden layers process and understands complex representation and output layer produces the final output).

In transformer architecture, both encoder and decoder are trying to learn the complex representations using their FNNs, the encoder builds the contextual understanding from the user’s prompt while the decoder further takes generated tokens from encoder as an input and train its layers to generate the next set of tokens.

To understand more about these neural networks and their functioning in detail, I would advise going through the neural-networks-deep-learning course that talks about these networks in more detail.

Happy learning!

Topic		Replies	Views
In attention is all you need lesson, the use of feed forward network in the encoder and decoder module is not quiet clear. It will be helpful if someone can explain it clearly Generative AI with Large Language Models week-module-1	3	44	January 4, 2025
What is the use of Feed forward layer in Transformer Generative AI with Large Language Models week-module-1	4	1586	July 13, 2023
General Understanding of Transformer Encoder and Decoder blocks NLP with Attention Models week-module-3	7	884	January 22, 2024
Questions about transformer architecture Generative AI with Large Language Models ai-discussions	1	63	October 8, 2024
Feed forward example for unsupervised learning AI Discussions	1	52	May 18, 2023

Explaination of Feedforward network in encoder and decoder

Related topics