In attention is all you need lesson, the use of feed forward network in the encoder and decoder module is not quiet clear. It will be helpful if someone can explain it clearly

bshree · December 3, 2024, 5:34pm

Link to the classroom item you are referring to:https://www.coursera.org/learn/generative-ai-with-llms/lecture/R0xbD/generating-text-with-transformers

gent.spah · December 4, 2024, 11:45am

If you take the NLP Specialization, it is explained in detail!

SofTim · January 3, 2025, 6:46pm

I browsed through the course 4" Natural Language Processing with Attention Models" which seemed to be the most likely to tackle this question. But, based on the names of the videos, I am not sure it is discussed. Would you be able to provide the name of the course where the role of FFN is for transformer is discussed?

gent.spah · January 4, 2025, 2:23pm

It should be the course with Attention Models, I think its the last course on the specialization!

Topic		Replies	Views
Explaination of Feedforward network in encoder and decoder Generative AI with Large Language Models week1	0	21	December 3, 2024
What is the use of Feed forward layer in Transformer Generative AI with Large Language Models week-1	4	1403	July 13, 2023
Something is wrong in the Decoder Block (of the Week2 ): Contradiction with the paper "Attention is all you need" NLP with Attention Models week-2	6	699	January 31, 2022
Attention is all you need GenAI with LLMs Resources	0	509	July 27, 2023
Transformers architecture - Week 1 \| Coursera Generative AI with Large Language Models week-1	1	926	December 2, 2023

In attention is all you need lesson, the use of feed forward network in the encoder and decoder module is not quiet clear. It will be helpful if someone can explain it clearly

Related topics