Better Qwen3VL chat template. #17

alcoftTAO · 2025-11-05T18:36:43Z

New template for Qwen3-VL!
This new template allows the usage of tools/functions, as well as executing tools/functions after answering the user's prompt/question (an issue previous Qwen models had because of their chat template).

Also added some quality-of-life improvements, such as extra_template_arguments where derivatives of the Llava15ChatHandler class can add/overwrite arguments to the Jinja2 template.

Also added a thinking_budget parameter in the Qwen3VLChatHandler class for future updates, the model right now seems to ignore it.

Removed the use_thinking_prompt parameter because the new template doesn't need them; works with both Qwen3-VL-Instruct and Qwen3-VL-Thinking.

I only tested it with Qwen3-VL-2B (both Thinking and Instruct versions) and seems to work fine.

Previously, the Thinking version of the models didn't generate the <think> XML tag because it was written in the template, so I fixed that. Now the Thinking models generate the <think> tag.

alcoftTAO · 2025-11-05T18:39:30Z

@JamePeng Please let me know if this project supports video inference for multimodal models, since I'd also like to implement it in the template if supported.

JamePeng · 2025-11-06T11:19:04Z

@JamePeng Please let me know if this project supports video inference for multimodal models, since I'd also like to implement it in the template if supported.

You can follow the progress of this implementation; I will adapt it when merging it into the main project: ngxson/llama.cpp#32

Previously, the Thinking version of the models didn't generate the <think> XML tag because it was written in the template, so I fixed that. Now the Thinking models generate the <think> tag.

The chat_template in the Qwen3VL-thinking series contains the tag. It's best to keep it consistent with the official version. Disabling it won't affect usage, but without forced thinking, there's a possibility that some users won't think at all.

See: https://huggingface.co/Qwen/Qwen3-VL-8B-Thinking/blob/main/chat_template.json

alcoftTAO · 2025-11-06T13:45:38Z

The chat_template in the Qwen3VL-thinking series contains the tag. It's best to keep it consistent with the official version. Disabling it won't affect usage, but without forced thinking, there's a possibility that some users won't think at all.

I tested it and had no issues, but I'll add the tags back if you'd like!

alcoftTAO · 2025-11-07T23:22:48Z

Done, I added the <think> tag back.

JamePeng · 2025-11-08T00:45:49Z

I see that thinking_budget hasn't been implemented yet. Should we not pass it as a parameter for now? That's about it.

JamePeng · 2025-11-08T01:44:17Z

llama_cpp/llama_chat_format.py

+                "{%- for content in message.content -%}"
+                    "{%- if 'image_url' in content -%}"
+                        "{%- set image_count.value = image_count.value + 1 -%}"
+                        "{%- if add_vision_id -%}"


It seems there's no way to pass the add_vision_id tag? Without a counter, multi-image recognition can easily lead to misinterpretations.

JamePeng · 2025-11-08T02:44:35Z

LGTM

Better Qwen3VL chat template.

1d41825

Update Submodule vendor/llama.cpp 48bd265..b7f9010

24d72de

alcoftTAO and others added 2 commits November 8, 2025 00:20

Merge branch 'JamePeng:main' into main

a1c764b

Updated chat template for Qwen3-VL to add the <think> tag again.

58ee399

JamePeng force-pushed the main branch from 24d72de to a1c6dc2 Compare November 8, 2025 00:38

Merge branch 'JamePeng:main' into main

48d6507

Deleted 'thinking_budget' because it's not implemented yet.

a749dfa

JamePeng reviewed Nov 8, 2025

View reviewed changes

Added 'add_vision_id' to the chat template.

14d14cc

JamePeng merged commit 17ba24f into JamePeng:main Nov 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better Qwen3VL chat template. #17

Better Qwen3VL chat template. #17

Uh oh!

alcoftTAO commented Nov 5, 2025

Uh oh!

alcoftTAO commented Nov 5, 2025

Uh oh!

JamePeng commented Nov 6, 2025

Uh oh!

alcoftTAO commented Nov 6, 2025

Uh oh!

alcoftTAO commented Nov 7, 2025

Uh oh!

JamePeng commented Nov 8, 2025

Uh oh!

JamePeng Nov 8, 2025

Uh oh!

JamePeng commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Better Qwen3VL chat template. #17

Better Qwen3VL chat template. #17

Uh oh!

Conversation

alcoftTAO commented Nov 5, 2025

Uh oh!

alcoftTAO commented Nov 5, 2025

Uh oh!

JamePeng commented Nov 6, 2025

Uh oh!

alcoftTAO commented Nov 6, 2025

Uh oh!

alcoftTAO commented Nov 7, 2025

Uh oh!

JamePeng commented Nov 8, 2025

Uh oh!

JamePeng Nov 8, 2025

Choose a reason for hiding this comment

Uh oh!

JamePeng commented Nov 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants