Is there a way to receive the answer after the max token?

Goomin · April 18, 2024, 8:49am

Hello, I am currently using the Claude 3 model.
Claude3’s max token is 4096 tokens.

In other words, Claude 3 can produce a maximum length of 4096 tokens when outputting once.

I asked Claude3 to translate a huge amount of documents, but because Claude3’s max token is 4096, the model is not outputting the entire length.

How can I continue to receive answers after Max Token?

TMosh · April 18, 2024, 2:39pm

I think you cannot.

nick_nikolaev · April 23, 2024, 8:18am

Isn’t the answer RAG?

You can’t pass a lot of data to OpenAI/Gemini/… → You save data in vector DB and use langchain/llamaindex to query your data.

Isn’t this the reason RAG exists?

Thanks in advance.

Goomin · April 24, 2024, 1:30am

But how?
Is that possible?

How should I save it in vector db to get the next response of Max Token?

Goomin · April 24, 2024, 1:31am

“I asked Claude3 to translate a huge amount of documents, but because Claude3’s max token is 4096, the model is not outputting the entire length.”

What you said is that you can save the response output from this output to Vector DB and continue to receive the next value.

nick_nikolaev · April 24, 2024, 5:51pm

You have to upload the huge amount of documents as vectors in a vector db like Pinecone. Then build a RAG pipeline (some fancy name for vector searching), and you should overcome the issue with not being able to pass all the data you have to the model.

Topic		Replies	Views
Handling large number of tokens ChatGPT Prompt Engineering for Developers	4	216	April 29, 2023
How to generate a prompt which exceeds OpenAI token limits ChatGPT Prompt Engineering for Developers	1	146	October 12, 2023
Response cut-off for llama 8B intruct Generative AI with Large Language Models ai-discussions	6	180	August 27, 2024
How large could the system message be when creating a chat bot? ChatGPT Prompt Engineering for Developers	2	119	May 18, 2023
ChatBot token limit ChatGPT Prompt Engineering for Developers	4	162	June 8, 2023

Is there a way to receive the answer after the max token?

Related topics