Langhcain for LLM application development. Memory

Sally_Muhammad · November 15, 2023, 10:17am

why do we specify the max number of tokens in ConversationSummaryBufferMemory() ?
is it for when summarizing the past conversation, the max number will specify how much the text is going to be summarized or for another reason?

balaji.ambresh · November 15, 2023, 11:27am

Your guess is correct. See this notebook

Sally_Muhammad · November 15, 2023, 11:44am

I don’t know I think I got confused so let me check whether the thing that I got from the code is right or not
At the very first beginning , the conversation was too small that it didn’t reach the specified maximum number of tokens so the model didn’t have to do any kind of summarization . However, as the chat proceeds , the past conversation started to exceed the max number of tokens as a result the model will start to summarize the messages from the beginning until the number of tokens return back to be <= the max number (so for example if we’ve got 5 past messages then when we summarized the first 2 , the number of tokens was decreased, so the model won’t summarize the rest of the 5 messages and will return them as they are). Is this right?

balaji.ambresh · November 15, 2023, 12:32pm

Right again. Instead of blindly discarding messages from the past, a summary is extracted to ensure that we retain the important information about the messages to be discarded. See this as well.

See ConversationBufferWindowMemory to understand why ConversationSummaryBufferMemory was introduced.

Sally_Muhammad · November 15, 2023, 1:15pm

Thank you so much

Topic		Replies	Views
ConversationSummaryBufferMemory LangChain for LLM Application Development ai-discussions	0	32	August 27, 2024
Context Window (memory) Generative AI with Large Language Models week-1	6	545	June 30, 2023
Lesson2 - Understanding the Purpose of 'save context' in LangChain's Memory Module LangChain for LLM Application Development	0	87	August 20, 2024
Would it be a good strategy to train on "summarization" for creating a chat bot? Generative AI with Large Language Models week-2	4	363	October 19, 2023
Retrieval with memory still doesn't work well LangChain for LLM Application Development	4	565	December 28, 2023

Langhcain for LLM application development. Memory

Related topics