Handling large number of tokens

balaji.ambresh · April 29, 2023, 10:39am

Generating a running summary, as pointed by @Eric_Townes is a good place to start. Here are 2 more approaches where summarization may not be required:

If quality of responses should be high, switch to a bigger model once the token limit is reached. After switching to the largest model your company can pay for, get a human involved.
If earlier parts of the conversation are unimportant, remove them from the chat history to get more space for more recent sentences.

Topic		Replies	Views
How to generate a prompt which exceeds OpenAI token limits ChatGPT Prompt Engineering for Developers	1	135	October 12, 2023
1024 token limit for text summarization? Building Generative AI applications with Gradio	3	418	July 31, 2023
ChatBot token limit ChatGPT Prompt Engineering for Developers	4	132	June 8, 2023
ConversationSummaryBufferMemory LangChain for LLM Application Development ai-discussions	0	35	August 27, 2024
Langhcain for LLM application development. Memory LangChain for LLM Application Development	4	223	November 15, 2023

Handling large number of tokens

Related topics