Feature: generic /compact command, auto-compaction, and fork-aware conversations

## Summary

Add a provider-agnostic `/compact` command and auto-compaction support to opencode core, along with fork-aware conversation IDs that play nicely with Codex-style `prompt_cache_key` semantics.

## Background

In `openai/codex` (codex-rs), compaction is a first-class feature:
- `codex-rs/core/src/compact.rs`, `core/src/tasks/compact.rs`, and `docs/slash_commands.md` define:
  - A `/compact` slash command that summarizes conversation history.
  - An `auto_compact_token_limit` based on model metadata (`openai_model_info.rs`).
  - A `SUMMARIZATION_PROMPT` used to create a handoff summary.
- Tests like `core/tests/suite/compact_resume_fork.rs` verify interactions between compaction, resume, and forks, especially with respect to `prompt_cache_key`.

In the Codex OAuth plugin for opencode:
- We currently **strip** OpenCode’s own auto-compaction system prompts in CODEX_MODE because they reference a `summary file` path that doesn’t exist for our stateless ChatGPT backend.
- We rely on `store: false` + `include: ["reasoning.encrypted_content"]` to preserve context, but we do not have a coherent `/compact` story at the host level.

## Problem

Without core support for compaction and fork-aware conversation IDs:
- Each provider has to reinvent compaction semantics (or avoid them entirely), leading to inconsistent behavior across backends.
- Codex-style `prompt_cache_key` reuse across resume/fork flows is hard to get right because opencode doesn’t expose explicit fork events to providers.
- The Codex plugin can emulate compaction internally, but it cannot:
  - Register a shared `/compact` slash command.
  - Coordinate auto-compaction decisions across sessions in a provider-agnostic way.

## Proposed Changes

1. **Add a generic `/compact` command in opencode core**
   - Define a new slash command (e.g. `/compact`) that:
     - Requests a summary of the current conversation from the active provider.
     - Replaces (or augments) conversation history with the returned summary so future turns operate on a compacted context.
   - Design the protocol so that providers receive a clear “compact this conversation” signal, not just a normal user message.

2. **Introduce `autoCompactTokenLimit` in core configuration**
   - Add a per-provider and per-model `autoCompactTokenLimit` field in opencode’s config model (analogous to Codex’s `auto_compact_token_limit`).
   - Track approximate token usage per conversation (input + output), and when a configurable threshold is crossed:
     - Automatically trigger the compact action (e.g. run `/compact` under the hood) before sending the next user prompt.
   - Allow users to override or disable auto-compaction globally and per-project.

3. **Fork-aware conversation IDs and provider hooks**
   - Extend the core session manager so that:
     - Conversations have explicit, stable IDs.
     - Forks (branching from an existing conversation) are modeled as first-class operations with `parentConversationId` and optional `forkId`/`branchId`.
   - Provide a provider hook (e.g. `onConversationFork`) that:
     - Allows providers to adjust their own cache keys (`prompt_cache_key` for Codex) when forks occur.
     - Enables Codex-like semantics where:
       - Overrides do *not* change `prompt_cache_key`.
       - Forks do generate new cache keys derived from the parent.

4. **Provider API surface for compaction**
   - Define a provider capability flag (e.g. `supportsCompaction`) and a method like `compactConversation(context) -> summary`.
   - Implement this in the Codex plugin by:
     - Reusing the official `compact` prompt from Codex CLI or a close approximation.
     - Leveraging encrypted reasoning content to maintain continuity.
   - Other providers (OpenAI Platform, local models, etc.) can opt in with their own prompts.

## Why This Belongs in Core

- Compaction and conversation forking are cross-provider concerns: every backend has context limits and benefits from summarization.
- A generic `/compact` command and auto-compaction framework let plugins focus on the *prompt* and *model details* instead of reimplementing session semantics.
- Codex-specific details (like `prompt_cache_key`) become easier to integrate cleanly when opencode’s session model exposes the right hooks.

## Acceptance Criteria

- There is a new `/compact` command in opencode that providers can implement.
- Configuration supports `autoCompactTokenLimit` (or similar) per provider/model.
- Conversations and forks have explicit IDs exposed to providers.
- Providers can implement fork-aware caching (Codex: `prompt_cache_key`, others: own fields) with clear lifecycle events.
- The Codex OAuth plugin and at least one non-Codex provider are able to adopt the new compaction hooks in a consistent way.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: generic /compact command, auto-compaction, and fork-aware conversations #4317

Summary

Background

Problem

Proposed Changes

Why This Belongs in Core

Acceptance Criteria

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature: generic /compact command, auto-compaction, and fork-aware conversations #4317

Description

Summary

Background

Problem

Proposed Changes

Why This Belongs in Core

Acceptance Criteria

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions