feat(mistral): allow Mistral transforms for local/custom models #5053

graelo · 2025-12-04T10:24:26Z

Thanks for this wonderful project!

This PR addresses an issue when running Mistral-family models (codestral, devstral, etc.) through custom providers like local vLLM/sglang/llama.cpp or ollama setups.

The problem I have

Mistral models have specific requirements:

Tool call IDs must be exactly 9 alphanumeric characters
A tool message cannot be directly followed by a user message (needs an assistant message in between)

The existing transform logic only kicked in when providerID === "mistral" or the model name contained mistral. This is smart, but it means custom providers serving codestral/devstral would fail with cryptic API errors, typically for me, after tool calls:

Unexpected role 'user' after role 'tool' Unexpected role 'user' after role 'tool'

The fix I suggest

Two changes:

Added a transforms config option at the provider level, so you can explicitly opt-in:

{
  "provider": {
    "my-local-llm": {
      "api": "http://localhost:8080/v1",
      "npm": "@ai-sdk/openai-compatible",
      "transforms": "mistral",
      "models": { ... }
    }
  }
}

Extended pattern matching to also catch codestral, devstral, ministral and pixtral model names automatically.

The detection now works as: explicit config > providerID match > model name pattern.

Also updated the DeepSeek detection to follow the same pattern for consistency (supports transforms: "deepseek" now too).

Testing

Added tests covering:

Config-based transform activation
Pattern matching for each model variant
Tool call ID normalization
Message sequence fixing
Non-Mistral providers unaffected

Happy to adjust the approach if there's a better way to handle this!

packages/opencode/src/provider/transform.ts

rekram1-node · 2025-12-04T17:46:02Z

Can you share your opencode.json? So you are using a mistral model but mistral isn't in the model name at all?

Idk I think it may be cleaner just to rename the model in your config rather than introduce new config variations that complicate stuff further?

Maybe im missing something

graelo · 2025-12-04T21:40:54Z

Hi, thanks for your kind reply!

True, I actually don't always have "mistral" in the model name, and that's most of the time fixable on the server side. 100%.

However, it took me some time to realize model name has to contain "mistral". So yes that's easily fixable on the server side, once you know about there's a mechanism and the conditions under which it applies ;).

What motivates my PR is to let the user control the application of that critical correction mechanism. It's currently applied without the user being involved at all.

This silent interjection is truly helpful for most users, no doubt, but if you happen to dig a little bit more like fine-tuning or swapping models from different sources/quants, it may add up to the trickiness of the situation. See my funny rant below.

Details

Take a mistralai/ministral-3-14b-instruct-2512 and the quantized unsloth/Ministral-3-14B-Instruct-2512-GGUF equivalent (has no "mistral" in the name!, same for devstral, etc). Unsloth changes the tokenizer and the chat template, so they straighten message order in their template (when they don't mess it up, nothing's perfect). So for their quant, opencode's mechanism is not applied, but tool calling works!
You then pick a different quant that does not hack the chat template, then it breaks, for instance cpatonn/Devstral-Small-2507-AWQ-4bit.
If you rename the original model after fine-tuning, it won't work again.

Debugging this kind of situation is tricky, at best. You end up attributing the issues to

the vLLM tool call parser (happens it has been having actual Mistral tool calling issues for the past year),
or to a streaming issue
to the original model itself
or the quant tech, or quant level wrt the model size, at that context window fill ratio,
maybe you should try sglang for this, or llama.cpp, etc. 😆

I'm sure you know very well what I'm talking about. There are so many ways that these things can break. It's passionating, but sometimes, just a bit of control really helps stabilize things and debug.

I'm not happy with the fact my PR changes the config template, it's very visible while you had tried to make this elegant and silent. Do you think another config approach would be better? Somewhere else than in the model provider?

At the end of the day, I'll understand if you feel this is too much added complexity for no real benefit: the user still must know about this parameter 🤷‍♂️

rekram1-node · 2025-12-04T22:29:43Z

For your usecase it sounds like it makes more sense as a plugin... I wonder if we should add a hook here:

export function message(msgs: ModelMessage[], model: Provider.Model) {
    msgs = normalizeMessages(msgs, model)
    if (model.providerID === "anthropic" || model.api.id.includes("anthropic") || model.api.id.includes("claude")) {
      msgs = applyCaching(msgs, model.providerID)
    }

    return msgs
  }

rekram1-node · 2025-12-04T22:30:03Z

What do u think?

graelo · 2025-12-04T22:30:44Z

Brilliant!

rekram1-node · 2025-12-04T22:36:16Z

Only thing is normalizeMessages has them in the ai sdk format, and wed need the plugin to expose our format

graelo · 2025-12-06T22:11:22Z

Thanks, I'll try to wrap my head around plugins and suggest something. I think I'll close this PR for now. WDYT?

rekram1-node · 2025-12-06T22:58:41Z

someone started working on a plugin hook for this actually:
#4910

I've been discussing w/ them on discord

graelo · 2025-12-06T23:00:30Z

Thanks @rekram1-node! I'm happy to close this one. Cheers

feat(mistral): add config for conversation transforms

26edb30

graelo commented Dec 4, 2025

View reviewed changes

packages/opencode/src/provider/transform.ts Outdated Show resolved Hide resolved

feat(mistral): add auto-detection for ministral

6142307

rekram1-node force-pushed the dev branch from 398b39a to 5013d64 Compare December 5, 2025 05:13

graelo closed this Dec 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(mistral): allow Mistral transforms for local/custom models #5053

feat(mistral): allow Mistral transforms for local/custom models #5053

Uh oh!

graelo commented Dec 4, 2025 •

edited

Loading

Uh oh!

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 4, 2025 •

edited

Loading

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 4, 2025

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 6, 2025

Uh oh!

rekram1-node commented Dec 6, 2025 •

edited

Loading

Uh oh!

graelo commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(mistral): allow Mistral transforms for local/custom models #5053

feat(mistral): allow Mistral transforms for local/custom models #5053

Uh oh!

Conversation

graelo commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The problem I have

The fix I suggest

Uh oh!

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 4, 2025

Uh oh!

rekram1-node commented Dec 4, 2025

Uh oh!

graelo commented Dec 6, 2025

Uh oh!

rekram1-node commented Dec 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

graelo commented Dec 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

graelo commented Dec 4, 2025 •

edited

Loading

graelo commented Dec 4, 2025 •

edited

Loading

rekram1-node commented Dec 6, 2025 •

edited

Loading