W4E8- confusing about the argument called output_setence

1047780419 · June 15, 2022, 9:15pm

def call(self, input_sentence, output_sentence, training, enc_padding_mask, look_ahead_mask, dec_padding_mask):

I am kind of confused about the argument here, For the transformer exercise, there is an argument called output_sentence, but I don’t know where to put it, is it a part in the decoder? but We already have an input_sentence which is supposed to be the same representation as the output_sentence? or did I miss something?

TMosh · June 15, 2022, 10:11pm

“final_output” is the prediction for the next output in the sequence.

TMosh · June 15, 2022, 10:12pm

I think this assignment makes a bit more sense if you continue with the two ungraded labs.

anon57530071 · June 16, 2022, 12:42am

Here is an overview.

“input_sentence” is a source to be translated, and fed into the Encoder. In this picture, it is French sentence.
“output_sentence” is a sentence of “Ground Truth” to be used for training in the Decoder. In this picture, it is English sentence.

In the Decoder, an overview of a process flow is;

create a self-attention by using “output_sentence” followed by normalization
calculate similarity of query (Q) by using output (V,K), which is “enc_output”, from the Encoder
find the best word sequence to be translated.

Hope this helps.

anon57530071 · June 16, 2022, 2:58am

One addition about the behavior at the ‘prediction time’. In the training time, we feed “grand truth” to the Decoder. In the prediction (inference) time, output_sentence is actually output from the Decoder. When the Decoder predicts the translated word at time t, the Decoder refers a past sequence of “translated” words until time t-1. In the other words, at time t, output_sequence is actual output sequence from Decoder at time t-1.
Again, this is for the prediction time. Hope this clarifies behaviors at the training time and the prediction time.

1047780419 · June 16, 2022, 3:16am

That makes it much clear! Thanks for your help!

Topic		Replies	Views
C5_W4 Transformer - Flummoxed. Why do we pass the output sentence to the decoder Sequence Models	6	528	May 17, 2023
Transformer coding question NLP with Sequence Models week-4	1	260	December 8, 2023
C5W4 Questions after finish the course Sequence Models	5	264	December 30, 2023
Course 5 Week 4 Assignment Exercise 8 code comment typo? Sequence Models	1	591	July 21, 2021
C5_W4_A1_Ex- 8_Transformer_UNQ_C8_Wrong values Sequence Models	6	708	October 8, 2022

W4E8- confusing about the argument called output_setence

Related topics