W4A1 target_sequence_length?

jakhon77 · March 11, 2025, 3:59am

In the decoder part of the transformer model, I see that target_sequence_length is a fixed length. Why is it fixed, given that we usually don’t know how many tokens the target will end up being, especially in machine translation? Thanks!

paulinpaloalto · March 11, 2025, 4:15am

This was discussed in the earlier section of the notebook about “Masking”. Please have a look at the instructions there. E.g. here’s one key sentence from that section:

When passing sequences into a transformer model, it is important that they are of uniform length. You can achieve this by padding the sequence with zeros, and truncating sentences that exceed the maximum length of your model:

Topic		Replies	Views
Small typos in W4 assignment Sequence Models	3	516	November 15, 2022
Length of input, output vectors Sequence Models	1	523	May 3, 2022
Week 4 : 10 words are vectorized to only 9 numbers? Sequence Models	1	513	June 15, 2022
Transformer Model Decoder Question Sequence Models	1	445	July 15, 2023
Transformer Pre-processing Max Sequence Length Sequence Models	1	576	September 13, 2021

W4A1 target_sequence_length?

Related topics