Improve performance of LLM summarizing condenser #6597

csmith49 · 2025-02-03T20:24:09Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

Updates the LLM-based summarizing condenser to improve performance.

Updates the prompt, fine-tuned with help from Claude.
LLMSummarizingCondenser now relies on the RollingCondenser to control when summarization happens.
Expanded unit tests to stress more complex configuration and usage.
The LLMSummarizingCondenser is now the default condenser (still off by default!), compared to the more aggressive AmortizedForgettingCondenser that has the same forgetting behavior as the LLMSummarizingCondenser but without the summarization.

To test performance, I ran this condenser against the no-condensation baseline (both on OH v0.21.0) on the whole of SWE-bench Verified with a maximum of 100 iterations. The condenser resolved 200 instances, compared to the baseline's 203, and cost $40 more to run due to the lower prompt cache utilization. However, the average response latency is a consistent 8 seconds compared to the 12 seconds (at iteration 30) and 16 seconds (at iteration 100) of the baseline.

Link of any specific issues this addresses

csmith49 · 2025-02-03T20:25:34Z

openhands/memory/condenser/impl/llm_summarizing_condenser.py

-            # Create a new summary event with the condensed content
-            summary_event = AgentCondensationObservation(summary_response)
+        for forgotten_event in forgotten_events:
+            prompt += str(forgotten_event) + '\n\n'


This might be an argument for standardizing the Event -> Message conversion and breaking it out of the agent control-flow.

openhands/core/config/condenser_config.py

openhands/memory/condenser/impl/llm_summarizing_condenser.py

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

openhands/memory/condenser/impl/llm_summarizing_condenser.py

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

enyst · 2025-02-03T21:02:22Z

openhands/memory/condenser/impl/llm_summarizing_condenser.py

+TESTS: test_format() passed
+CHANGES: str(val) replaces f"{val:.16G}"
+DEPS: None modified
+INTENT: Fix float precision overflow"""


Interesting that Sonnet is doing reasonable with this prompt. I'm going to test it on Deepseek 😅

(not necessary for this PR, of course, just a thought)

I'm surprised this is what it ended up being, it's definitely not how I would default to prompting!

I started off with a very simple prompt, then I passed it and a trajectory or two to Claude and asked what it would change. Took two or three iterations before it met the baseline but it got me from 0% to 40%. I have to imagine you'd arrive at a very different result if you repeated the process with DeepSeek.

enyst · 2025-02-03T21:35:58Z

tests/unit/test_condenser.py

    mock_llm.completion.assert_called_once()
    call_args = mock_llm.completion.call_args[1]
    assert 'messages' in call_args
    assert len(call_args['messages']) == 1
-    assert 'Event 1' in call_args['messages'][0]['content']
-    assert 'Event 2' in call_args['messages'][0]['content']


Maybe we can still test for some message inclusion? Not a big deal, just thinking that tests with condensation are not easy, so maybe we can include in unit tests whatever we can

enyst

Thanks for this, I absolutely love that we're doing this incrementally, and anyway we can play with that prompt later too.

…ith49/OpenHands into feat/llm-summarizing-condenser-fixes

Co-authored-by: Calvin Smith <calvin@all-hands.dev> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

Calvin Smith added 2 commits February 3, 2025 11:50

initial improvements on llm summarizing condenser

1ac96cc

llm summarizing condenser now default condenser

6b61153

csmith49 commented Feb 3, 2025

View reviewed changes

csmith49 requested a review from enyst February 3, 2025 20:29

enyst reviewed Feb 3, 2025

View reviewed changes

openhands/core/config/condenser_config.py Outdated Show resolved Hide resolved

Merge branch 'main' into feat/llm-summarizing-condenser-fixes

b495467

enyst reviewed Feb 3, 2025

View reviewed changes

openhands/memory/condenser/impl/llm_summarizing_condenser.py Outdated Show resolved Hide resolved

csmith49 and others added 2 commits February 3, 2025 13:44

Update openhands/core/config/condenser_config.py

d4c18bb

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

Update openhands/memory/condenser/impl/llm_summarizing_condenser.py

01b5b7b

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

enyst reviewed Feb 3, 2025

View reviewed changes

openhands/memory/condenser/impl/llm_summarizing_condenser.py Outdated Show resolved Hide resolved

Update openhands/memory/condenser/impl/llm_summarizing_condenser.py

c300d75

Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

enyst reviewed Feb 3, 2025

View reviewed changes

enyst approved these changes Feb 3, 2025

View reviewed changes

Calvin Smith and others added 3 commits February 4, 2025 14:22

Merge branch 'feat/llm-summarizing-condenser-fixes' of github.com:csm…

eeea402

…ith49/OpenHands into feat/llm-summarizing-condenser-fixes

minor changes to prompting

2857ad0

Merge branch 'main' into feat/llm-summarizing-condenser-fixes

7099c9a

csmith49 enabled auto-merge (squash) February 5, 2025 03:22

csmith49 merged commit e47aaba into All-Hands-AI:main Feb 5, 2025
13 checks passed

adityasoni9998 pushed a commit to adityasoni9998/OpenHands that referenced this pull request Feb 7, 2025

Improve performance of LLM summarizing condenser (All-Hands-AI#6597)

88b2f36

Co-authored-by: Calvin Smith <calvin@all-hands.dev> Co-authored-by: Engel Nyst <enyst@users.noreply.github.com>

csmith49 mentioned this pull request Feb 12, 2025

[Bug]: Message: [Trimming prompt to meet context window limitations] making agent change actions. Shouldn't be passed to LLM #6634

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of LLM summarizing condenser #6597

Improve performance of LLM summarizing condenser #6597

csmith49 commented Feb 3, 2025

csmith49 Feb 3, 2025

enyst Feb 3, 2025

csmith49 Feb 3, 2025

enyst Feb 3, 2025

enyst left a comment

Improve performance of LLM summarizing condenser #6597

Improve performance of LLM summarizing condenser #6597

Conversation

csmith49 commented Feb 3, 2025

csmith49 Feb 3, 2025

Choose a reason for hiding this comment

enyst Feb 3, 2025

Choose a reason for hiding this comment

csmith49 Feb 3, 2025

Choose a reason for hiding this comment

enyst Feb 3, 2025

Choose a reason for hiding this comment

enyst left a comment

Choose a reason for hiding this comment