Question about the sentinels

arvyzukai · November 16, 2023, 11:10am

Hi @PZ2004

I don’t quite understand you question. Please read the last paragraph of this post and tell if you still have questions. In short, in T5:

Self-supervised training uses corrupted tokens, by randomly removing 15% of the tokens and replacing them with individual sentinel tokens (if several consecutive tokens are marked for removal, the whole group is replaced with a single sentinel token). The input of the encoder is the corrupted sentence, the input of the decoder is the original sentence and the target is then the dropped out tokens delimited by their sentinel tokens.

While in our assignment we do not implement the decoder part (and also other T5 tasks) so the assignment’s encoder has to predict the sentinels correctly.

Cheers

P.S. use “\” to escape the “<” symbols (that what makes your text bold or striked out and it is hard to read).

Topic		Replies	Views
C4_W3, regarding pretty_decode() NLP with Attention Models course-related , week-3	3	173	October 8, 2024
Why this code show up in C4W3 graded function NLP with Attention Models week-3	12	38	August 17, 2024
C4W3_Assignment Sentinel Index Conflicts NLP with Attention Models week-3	2	44	July 8, 2024
The tokens that decoder block use Sequence Models week-4	3	205	April 15, 2024
C4W1_Neural Machine Translation_Exercise 5 - translate NLP with Attention Models week-1	11	58	November 9, 2024

Question about the sentinels

Related topics