C5 Week 4 understanding transformer forward pass

MENGQI_ZHOU · March 14, 2022, 7:53pm

In this week’s assignment, Transformer.call function calls encoder once given input sentence, it also calls encoder once given target output.
This make sense to me in the training process as attention matrix can be calculated all at once given the target output and look ahead mask.

My question is, when performing translation (say translate an English sentence into German) using trained transformer, do we also call Transformer.call only once? My understanding is that we generate word by word. So to begin with, we pass an empty output into decoder, and generate first word; then we pass the updated output to the decoder again to generate the second word etc. until EOS is generated. Is this correct?

In this case, don’t we need to call: encoder once to encode original sentence; and decoder multiple times to generate a sentence with multiple words? This logic doesn’t seem to be implemented by any functions of Transformer model.

MENGQI_ZHOU · March 15, 2022, 7:56am

Figured out in this implementation: ভাষা বোঝার জন্য ট্রান্সফরমার মডেল | Text | TensorFlow. For inference, another class Translator is introduced wraps the trained Transformer instance. When performing translation, it calls transformer multiple times.

Topic		Replies	Views
C5_W4 Transformer - Flummoxed. Why do we pass the output sentence to the decoder Sequence Models coursera-platform	6	534	May 17, 2023
C5_W4_A1_Ex- 8_Transformer_UNQ_C8_Wrong values Sequence Models coursera-platform	6	715	October 8, 2022
C5W4 assignment exercise 8: need help! Sequence Models coursera-platform	2	646	May 24, 2021
Week 4: Transformer Network (test time intuition) Sequence Models coursera-platform	1	516	April 21, 2022
Inference for NMT NLP with Attention Models week-module-2	11	433	June 23, 2023

C5 Week 4 understanding transformer forward pass

Related topics