ReformerLM output length limit

YIHUI · August 11, 2022, 9:14pm

In ReformerLM_output_gen function, output_gen = trax.supervised.decoding.autoregressive_sample_stream(…) seems not having a limit of the words being generated. Will it be automatically stopped by reaching the maximum window size of sequences in each batch? If so, which boundary in the following setting is chosen?

trax.data.BucketByLength(boundaries=[128, 256, 512, 1024],
batch_sizes=[16, 8, 4, 2, 1]),

wesleyalmeida · August 15, 2022, 2:10am

Hello @YIHUI!

as said in the docs of autoregressive_sample_stream:
“Inputs and outputs always come in batches, even if size 1. If inputs is present, it must have shape (batch_size, inputs_sequence_length), and each output in the stream has shape (batch_size, 1).”

So, in this case you need to create input tokens using the the tokenize function and this will be your inputs for autoregressive_sample_stream with shape (1, n), batch_size=1. Is this make sense for you?

Best regards,
Wesley P.

Topic		Replies	Views
Batch size problem in Exercise 6 NLP with Attention Models week-module-4	5	807	August 20, 2023
FlanT5- maximum input and output length? Generative AI with Large Language Models week-module-2	5	2583	September 9, 2023
Machine Translation Transformer keeps predicting same outputs repeatedly NLP with Attention Models week-module-3	4	884	August 17, 2023
Decoder-only Transformer Training/Inference Sequence Models coursera-platform	3	710	June 6, 2023
Configuration options in T5 transformer hunnging face Generative AI with Large Language Models ai-discussions	3	90	September 5, 2024

ReformerLM output length limit

Related topics