Ability to Summarize an LLM Conversation #216

brainlid · 2024-12-15T18:07:27Z

Adds LangChain.Chains.SummarizeConversationChain.

The purpose is to allow for long user/assistant conversations while trying to keep the context window and the total number of input tokens under control.

This takes a long LLMChain conversation (with numerous user and assistant back-and-forth messages) and combines the message contents into a single message so an LLM can summarize the conversation up to a certain point.

The summarized result is spliced into the original LLMChain where the system prompt is kept unaltered and some number of messages are removed and replaced with the summarized contents of the conversation. The system prompt used for summarizing can be overridden.

A threshold_count is used to specify the number of message that must be in the chain before the summarizing process is performed. A keep_count value specifies the number of unsummarized messages to keep on the returned chain. This helps with recent message consistency with the LLM.

It is safe to run on an LLMChain with a short number of messages because if the threshold_count is not reached, it returns the original LLMChain without summarizing.

NOTE: A few other fixes found there way into this PR. This includes:

Improved error handling
The ability to override LangChain.Chains.TextToTitleChain's system prompt.

- fixed LangChainError type spec - LLMChain.run - raise specific exception when being run without messages - try/rescue errors in LLMChain.run and return error tuple (fixes spec)

- updated docs with examples - support full `override_system_prompt` for greater customization

* main: Azure test for ChatOpenAI usage added documentation for ChatOpenAI use on Azure Fix specs and examples (#211) Fix content-part encoding and decoding for Google API. (#212)

* main: added error type support for Azure token rate limit exceeded

- operates on an LLMChain to shorten and summarize the messages

- changes when the keep_count is 0

brainlid added 9 commits December 10, 2024 12:48

fixes

527d1a4

- fixed LangChainError type spec - LLMChain.run - raise specific exception when being run without messages - try/rescue errors in LLMChain.run and return error tuple (fixes spec)

TextToTitleChain updates

9276ce6

- updated docs with examples - support full `override_system_prompt` for greater customization

tweak

8d8f587

Merge branch 'main' into me-summarize-conversation

d6e36c4

* main: Azure test for ChatOpenAI usage added documentation for ChatOpenAI use on Azure Fix specs and examples (#211) Fix content-part encoding and decoding for Google API. (#212)

Merge branch 'main' into me-summarize-conversation

e334e39

* main: added error type support for Azure token rate limit exceeded

adds LangChain.Chains.SummarizeConversationChain

b9c2581

- operates on an LLMChain to shorten and summarize the messages

updated TextToTitleChain docs

a07eca7

summarized chain's last_message get updated

c453493

- changes when the keep_count is 0

fixed failing test

600ce2e

brainlid merged commit 94980a3 into main Dec 15, 2024
1 check passed

brainlid deleted the me-summarize-conversation branch December 15, 2024 22:39

brainlid mentioned this pull request Dec 15, 2024

TextToTitleChain: Ability to specify min & max length #163

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ability to Summarize an LLM Conversation #216

Ability to Summarize an LLM Conversation #216

brainlid commented Dec 15, 2024 •

edited

Loading

Ability to Summarize an LLM Conversation #216

Ability to Summarize an LLM Conversation #216

Conversation

brainlid commented Dec 15, 2024 • edited Loading

brainlid commented Dec 15, 2024 •

edited

Loading