How to prompt text Q&A using llama3.2

omran · October 30, 2024, 8:17am

Dears,
I needed to change this llama3.2 image reasoning example to ask question “what is the capital of USA”
below is the image example that I got from HG model card

messages = [
{“role”: “user”, “content”: [
{“type”: “image”},
{“type”: “text”, “text”: "Describe this image: "}
]}
]
input_text = processor.apply_chat_template(messages, add_generation_prompt=True)
inputs = processor(
image,
input_text,
add_special_tokens=False,
return_tensors=“pt”
).to(model.device)

output = model.generate(**inputs, max_new_tokens=330)
print(processor.decode(output[0]))

omran · October 30, 2024, 10:20am

i sloved it using this code
messages = [
{“role”: “user”, “content”: [
{“type”: “image”},
{“type”: “text”, “text”: "what is written in the image: "}
]}
]
input_text = processor.apply_chat_template(messages, add_generation_prompt=True)
inputs = processor(
image,
input_text,
add_special_tokens=False,
return_tensors=“pt”
).to(model.device)

output = model.generate(**inputs, max_new_tokens=250)
#print(processor.decode(output[0]))

Decode the model’s output to get the text response

decoded_output = processor.decode(output[0], skip_special_tokens=True)

Post-process the decoded output to extract the answer (removing headers and eot tokens)

This assumes that the model’s answer follows a consistent pattern

answer = decoded_output.split(“assistant”)[-1].strip().replace(“<|eot_id|>”, “”).strip()

Print the clean answer

print(answer)

Topic		Replies	Views
Llama 3.2 prompting tutorial notebook AI Discussions project	1	251	December 10, 2024
RAG with Multimodel Text and Images AI Discussions ai-discussions , langchain , llama-index , project	0	289	May 9, 2024
Multimodal prompting Introducing Multimodal Llama 3.2	1	38	November 14, 2024
Hold that thought! consider finishing all 3 text GPT courses before posting questions here ChatGPT Prompt Engineering for Developers	1	168	June 10, 2023
It will be very helpful if you use open source LLM like falcon-40b-instruct along side OpenAI LangChain for LLM Application Development	0	107	June 7, 2023

How to prompt text Q&A using llama3.2

Decode the model’s output to get the text response

Post-process the decoded output to extract the answer (removing headers and eot tokens)

This assumes that the model’s answer follows a consistent pattern

Print the clean answer

Related topics