fix(session): optimize system reminder to reduce token usage #11136

baixiangcpp · 2026-01-29T13:43:56Z

What does this PR do?

This PR fixes a token amplification issue caused by the watchdog reminder logic.

Previously, the session loop would wrap every queued user message in <system-reminder> tags and inline the user’s full text on every iteration when step > 1. As a result, the same user content was repeatedly duplicated in the model input, causing input tokens to grow with both the number of loop iterations and the length of queued messages (effectively O(steps × message_length)). This led to rapid context window exhaustion and unnecessarily high inference costs.

Changes in this PR

Refactored reminder handling
Removed the logic that mutates user message parts. User messages remain intact and are no longer rewritten/expanded during processing.
Optimized prompt injection
Replaced “copy user text into a reminder” with a concise system reminder that references the existence/count of queued messages (e.g., “There are X new user messages waiting…”), without duplicating user content.
Added throttling to prevent reminder spam
Implemented exponential backoff for reminder injection (starting at 15s, doubling up to 5 minutes), so long-running tasks are not repeatedly interrupted by identical reminders.
Added reminder deduplication state
Introduced queuedReminder state to track the latest queued user message ID and avoid reinjecting reminders for the same queued message on every loop cycle.

How did you verify your code works?

Token usage verification
Simulated multi-turn conversations where the user sends multiple messages while the agent is busy. Confirmed input token size remains stable and does not grow explosively over repeated loop iterations.
Behavioral testing
Verified the agent still detects queued user messages and prioritizes responding to them, but now respects the backoff window instead of emitting reminders every iteration.
Regression testing
Exercised normal chat flow, compaction, and tool execution paths to confirm no behavioral regressions and that tasks still complete successfully.
Build check
Ran bun run build locally to ensure the changes compile cleanly with no type errors.

Issue

Fixes #11142

The previous system reminder logic wrapped every queued user message with XML tags and duplicated the user's text, causing significant input token amplification (O(N * Length)). This commit refactors the reminder mechanism to: 1. Stop modifying user messages directly. 2. Inject a concise system prompt only when necessary. 3. Implement exponential backoff (15s to 5min) for reminders to prevent excessive interruption during long tasks. 4. Add state management to deduplicate reminders for the same queued message.

github-actions · 2026-01-29T13:44:08Z

Thanks for your contribution!

This PR doesn't have a linked issue. All PRs must reference an existing issue.

Please:

Open an issue describing the bug/feature (if one doesn't exist)
Add Fixes #<number> or Closes #<number> to this PR description

See CONTRIBUTING.md for details.

github-actions · 2026-01-29T13:44:32Z

The following comment was made by an LLM, it may be inaccurate:

Based on my search results, I found one PR that appears potentially related:

Potentially Related PR:

fix(opencode): resolve performance bottleneck in prompt loop #10190: fix(opencode): resolve performance bottleneck in prompt loop
- fix(opencode): resolve performance bottleneck in prompt loop #10190
- This addresses performance issues in the prompt loop, which could be related to the session reminder optimization in PR fix(session): optimize system reminder to reduce token usage #11136. Both deal with improving efficiency in the session/prompt execution flow.

However, this appears to be addressing a different performance bottleneck rather than the specific token amplification issue from repeated system reminders that PR #11136 fixes.

No other duplicate or directly related PRs were found addressing the specific issue of system reminder token duplication and optimization through deduplication and throttling.

baixiangcpp · 2026-01-29T14:16:13Z

Fixes #11142

github-actions bot added the needs:issue label Jan 29, 2026

baixiangcpp mentioned this pull request Jan 29, 2026

Bug: Watchdog reminder causes excessive token amplification (O(N) growth) #11142

Open

github-actions bot removed the needs:issue label Jan 29, 2026

thdxr force-pushed the dev branch from cbab81f to 2d3c7a0 Compare January 30, 2026 04:49

opencode-agent bot force-pushed the dev branch from 00637c0 to 71e0ba2 Compare January 30, 2026 14:32

thdxr force-pushed the dev branch 4 times, most recently from f1ae801 to 08fa7f7 Compare January 30, 2026 14:37

github-actions bot mentioned this pull request Feb 3, 2026

perf: reduce tool description token usage by ~69% #11993

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(session): optimize system reminder to reduce token usage #11136

fix(session): optimize system reminder to reduce token usage #11136

baixiangcpp commented Jan 29, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Jan 29, 2026

Uh oh!

github-actions bot commented Jan 29, 2026

Uh oh!

baixiangcpp commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix(session): optimize system reminder to reduce token usage #11136

Are you sure you want to change the base?

fix(session): optimize system reminder to reduce token usage #11136

Conversation

baixiangcpp commented Jan 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Changes in this PR

How did you verify your code works?

Issue

Uh oh!

github-actions bot commented Jan 29, 2026

Uh oh!

github-actions bot commented Jan 29, 2026

Uh oh!

baixiangcpp commented Jan 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

baixiangcpp commented Jan 29, 2026 •

edited

Loading