feat(provider): add interleaved thinking support for models #5201

DanielusG · 2025-12-07T13:15:54Z

Summary

Add interleaved_thinking field to ModelsDev Model schema to detect models with interleaved thinking capability
Add interleavedThinking capability to provider Model interface for internal representation
Update transform logic to handle the new field mapping with proper default values
Add comprehensive test coverage for interleaved thinking transformation

What is Interleaved Thinking?

Interleaved thinking is a reasoning approach where large language models alternate between thinking and action/answering steps, rather than following the traditional "think-then-answer" pattern. Instead of generating a long chain of thought followed by a single response, models using interleaved thinking follow a pattern like:

Reason → Tool Call → Observe → Reason → Tool Call → ...

Key Benefits:

Reduced Latency: Cuts time-to-first-token (TTFT) by over 80% on average compared to traditional chain-of-thought reasoning
Dynamic Adaptation: Allows models to adjust their strategy based on intermediate results and tool outputs
Error Reduction: Enables immediate checking of reasoning steps, reducing error propagation in long chains
Enhanced Transparency: Provides inspectable multi-step thinking through reasoning_details structures
Better Performance: Shows up to 19.3% improvement in accuracy on complex reasoning tasks

Research & Sources

This implementation is based on current research and industry developments:

Research Paper: Interleaved Reasoning for Large Language Models via Reinforcement Learning - Shows 80% TTFT reduction and 19.3% accuracy improvement
Industry Documentation: Novita AI Interleaved Thinking Guide - Practical implementation guidance
Real-world Adoption: Models like MiniMax-M2 and Kimi-K2-Thinking already support this capability

Technical Changes

ModelsDev Schema: Added optional interleaved_thinking boolean field to detect model capability
Provider Interface: Added optional interleavedThinking boolean to Model capabilities
Transform Logic: Updated transformation functions to map between schemas with proper defaults
Backward Compatibility: Made field optional to ensure existing models continue to work
Test Coverage: Added tests to verify proper transformation and default handling

Applications

This capability transforms traditional function-calling into agent-level tool use, making it particularly valuable for:

Complex multi-hop question answering
Mathematical reasoning
Logical deduction
Tool-assisted problem solving

Testing

All existing tests pass, and new test coverage has been added for the interleaved thinking transformation logic. The changes maintain full backward compatibility with existing model configurations.

- Add interleaved_thinking field to ModelsDev Model schema - Add interleavedThinking capability to provider Model interface - Update transform logic to handle interleaved_thinking field mapping - Add test coverage for interleaved thinking transformation This enables support for models with interleaved thinking capabilities in the provider system, allowing better integration with models that support this feature.

- Make interleavedThinking field optional to maintain backward compatibility - Prevent TypeScript errors when models don't have this new capability yet

jerome-benoit · 2025-12-07T17:09:27Z

packages/opencode/src/provider/provider.ts

          capabilities: {
            temperature: model.temperature ?? existingModel?.capabilities.temperature ?? false,
            reasoning: model.reasoning ?? existingModel?.capabilities.reasoning ?? false,
+            interleavedThinking: model.interleaved_thinking ?? existingModel?.capabilities.interleavedThinking ?? false,


the casing is not consistent across the change set: stick to the one used in the code base.

I've updated the naming to interleavedthinking (all lowercase) to align with the existing toolcall convention in the codebase. Pushed the fix.

Address review feedback: rename interleavedThinking to interleavedthinking to match the codebase convention (e.g., toolcall instead of toolCall)

rekram1-node · 2025-12-08T16:30:34Z

I think the correct fix is adding the reasoning_details support to the openai compatible provider. We should track the interleveaned thinking boolean per model tho but that should first be done on models.dev

I am going to add interleaved thinking support to our custom ai sdk provider

DanielusG · 2025-12-08T16:43:40Z

I think the correct fix is adding the reasoning_details support to the openai compatible provider. We should track the interleveaned thinking boolean per model tho but that should first be done on models.dev

I am going to add interleaved thinking support to our custom ai sdk provider

I tried using the reasoning_details parameter, but it didn't work for many providers; for example, LiteLLM doesn't work, nor does VertexAI (for API kimi and minimax). Instead, I tried passing the reasoning via content, and GPT OSS magically became more competent—it was like night and day for simple local tasks. MiniMax and Kimi also had the same result; before, in their reasoning, they constantly showed "The user asked me....", whereas now, for subsequent messages, they respond to the tool.

rekram1-node · 2025-12-08T17:04:32Z

Ah okay that's a good point. Hm okay I'll do some more research and we will talk internally about this problem in a few hrs. I do see why this fix works, it does feel a bit like a hack but very thankful for you bringing this to my attention i will keep u posted

Mushoz · 2025-12-09T19:29:00Z

Instead, I tried passing the reasoning via content, and GPT OSS magically became more competent

How did you do this? I am seeing the exact same thing that you are reporting: Eg, each reasoning message starts with "The user asks me..." instead of the model continuing where it left off.

DanielusG · 2025-12-10T09:42:56Z

Hi @rekram1-node
I’ve seen the PR about “better interleaved thinking” (#5298 ) but I can confirm that it still doesn’t work on LiteLLM Proxy.
Since I use many models from different providers, the only appropriate way for me to manage the situation and track costs is by using the litellm proxy; the problem is not limited to litellm alone, even querying llama.cpp directly, the reasoning is not passed back to the model.
In practice it seems that to ensure greater compatibility it would be better to include, in addition to the reasoning_content and reasoning_details fields, the content field.
Let me know what you think.

rekram1-node · 2025-12-13T05:52:28Z

there were 2 different interleaved thinking prs, what format does litellm expect?

can u not define this in your opencode.json? we can add more mapping options but if all your models are being defined by u you should be able to specify which data to send back

DanielusG · 2025-12-13T07:39:32Z

there were 2 different interleaved thinking prs, what format does litellm expect?

can u not define this in your opencode.json? we can add more mapping options but if all your models are being defined by u you should be able to specify which data to send back

It seems that nowadays there is no standard that all providers adhere to for interleaved thinking support, so everyone implements whatever version they like, and others don't even implement it at all.

That is why, in my opinion, it would be truly useful if OpenCode (and generally any LLM client) offered a certain degree of provider customization.

So, in the specific case of models on LiteLLM, it seems you have to pass it using content, but in others that support the OpenAI schema, you need to use specific fields like reasoning_content and reasoning_details though.

I saw merged PR #5207; could this be useful in any way for creating provider-specific plugins without messing up the configuration?

rekram1-node · 2025-12-13T18:25:39Z

We can add/expand the interleaved thinking configuration supports, but i don't think we should be converting all reasoning chunks to text parts, if there is a specific provider that requires it then maybe but so far all the providers that'd want it that way (that I've seen) will already send the reasoning chunks back as assistant messages with the ... tags.

DanielusG · 2025-12-16T16:47:44Z

@rekram1-node
I can confirm that the support for interleaving thinking with the parameter you implemented works by specifying the field reasoning_details or reasoning_content. My issue was that I was simply passing interleaved: true and it didn’t always work with all models; instead, specifying the field works even with litellm and other providers. For me, I can close this PR because a clearly better version has been implemented and mine was just a workaround. Perhaps simply updating the documentation about it could be marginally useful.

rekram1-node · 2025-12-16T17:03:35Z

Sweet

DanielusG added 2 commits December 7, 2025 13:33

fix(provider): make interleavedThinking optional in Model schema

6fd47c0

- Make interleavedThinking field optional to maintain backward compatibility - Prevent TypeScript errors when models don't have this new capability yet

jerome-benoit reviewed Dec 7, 2025

View reviewed changes

fix(provider): use consistent lowercase naming for interleavedthinking

ff75cf0

Address review feedback: rename interleavedThinking to interleavedthinking to match the codebase convention (e.g., toolcall instead of toolCall)

DanielusG closed this by deleting the head repository Dec 16, 2025

anomalyco deleted a comment from opencode-agent bot Dec 16, 2025

feat(provider): add interleaved thinking support for models #5201

feat(provider): add interleaved thinking support for models #5201

Uh oh!

Conversation

DanielusG commented Dec 7, 2025

Summary

What is Interleaved Thinking?

Key Benefits:

Research & Sources

Technical Changes

Applications

Testing

Uh oh!

jerome-benoit Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

DanielusG Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

rekram1-node commented Dec 8, 2025

Uh oh!

DanielusG commented Dec 8, 2025

Uh oh!

rekram1-node commented Dec 8, 2025

Uh oh!

Mushoz commented Dec 9, 2025

Uh oh!

DanielusG commented Dec 10, 2025

Uh oh!

rekram1-node commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DanielusG commented Dec 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rekram1-node commented Dec 13, 2025

Uh oh!

DanielusG commented Dec 16, 2025

Uh oh!

rekram1-node commented Dec 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rekram1-node commented Dec 13, 2025 •

edited

Loading

DanielusG commented Dec 13, 2025 •

edited

Loading