In attention is all you need lesson, the use of feed forward network in the encoder and decoder module is not quiet clear. It will be helpful if someone can explain it clearly

If you take the NLP Specialization, it is explained in detail!

1 Like