Attention mask and token id

devang_pagare · October 19, 2023, 6:04am

The attention mask and the pad token id were not set. As a consequence, you may observe unexpected behavior. Please pass your input’s attention_mask to obtain reliable results. Setting pad_token_id to eos_token_id:0 for open-end generation.

Whenever I am running the inference, I am getting above suggestions. Can anyone explain what it is and how to implement them?

nnilayy · August 26, 2024, 5:36pm

Hey there, i was also facing the same error and after tinkering with the tokenizer for a bit i found out that inputs = tokenizer.encode(prompt) returns just the input_ids but not the attention mask, Whereas inputs = tokenizer(prompt) returns both the input_ids and the attention mask.

So if you replace this code

inputs = tokenizer.encode(prompt)
output = model.generate(inputs)

With this

inputs = tokenizer(prompt)
output = model.generate(**inputs )

The error warning goes away. Hope that helps .

Topic		Replies	Views
Is the tokenizer a model? Generative AI with Large Language Models week-module-1	1	481	September 8, 2023
# UNQ_C3 help with mask NLP with Attention Models week-module-1	1	564	April 6, 2022
C4_W1_UNQ_C6 wrong ouput NLP with Attention Models week-module-1	3	524	March 26, 2023
Why this code show up in C4W3 graded function NLP with Attention Models week-module-3	12	42	August 17, 2024
C4W3_Assignment in Natural Language Processing with Attention Models NLP with Attention Models week-module-3	22	380	September 2, 2024

Attention mask and token id

Related topics