webui: support q URL parameter #16728

odrling · 2025-10-22T22:54:54Z

Fixes #16722
I’ve checked that it works with Firefox’s AI tools

I’m not sure if I have to rebuild the webui bundle in this PR or let a maintainer do it before merging.

Fixes ggml-org#16722 I’ve checked that it works with Firefox’s AI tools

allozaur

This needs proper Svelte 5 syntax as in the code suggestions.

tools/server/webui/src/routes/+page.svelte

tools/server/webui/src/routes/+page.ts

tools/server/webui/src/routes/+page.svelte

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

allozaur

Okay, I've just tested this locally and it's working great. Last thing to do is to include static build and then we'll need to pass the CI and we will be good to go ;)

odrling · 2025-10-23T15:20:26Z

OK I’ve updated the static build

* model-conversion : add trust_remote_code for orig model run [no ci] (ggml-org#16751) This commit add the trust_remote_code=True argument when loading models using AutoConfig, AutoTokenizer, and AutoModelForCausalLM for the run original model script. The motivation for this is that some models require custom code to be loaded properly, and setting trust_remote_code=True avoids a prompt asking for user confirmation: ```console (venv) $ make causal-run-original-model The repository /path/to/model contains custom code which must be executed to correctly load the model. You can inspect the repository content at /path/to/model. Do you wish to run the custom code? [y/N] N ``` Having this as the default seems like a safe choice as we have to clone or download the models we convert and would be expecting to run any custom code they have. * webui: support q URL parameter (ggml-org#16728) * webui: support q URL parameter Fixes ggml-org#16722 I’ve checked that it works with Firefox’s AI tools * webui: apply suggestions from code review Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com> * chore: update webui static build --------- Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com> * CUDA: use CUB for arbitary size argsort (ggml-org#16754) * ggml: fix CUDA grid launch condition for large block_nums.y in binbcast (ggml-org#16742) * Fix CUDA grid launch condition for large block_nums.y * add backend ops test * reduce test repetitions * convert : avoid dequantizing mxfp4 for GPT-OSS (ggml-org#16756) * vulkan: Optimize SSM_SCAN (ggml-org#16645) * vulkan: delete dead code (ggml-org#16732) ggml_vk_create_buffer_temp is not used anywhere, and it is the only caller for ggml_vk_pool_malloc. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> * model : set res->t_embd in PLaMo2 models (ggml-org#16766) --------- Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Co-authored-by: Daniel Bevenius <daniel.bevenius@gmail.com> Co-authored-by: Florian Badie <florianbadie@odrling.xyz> Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com> Co-authored-by: Aman Gupta <amangupta052@gmail.com> Co-authored-by: leejet <leejet714@gmail.com> Co-authored-by: compilade <git@compilade.net> Co-authored-by: Jeff Bolz <jbolz@nvidia.com> Co-authored-by: Giuseppe Scrivano <gscrivan@redhat.com> Co-authored-by: Shunta Saito <shunta.saito@gmail.com>

* qwen3-coder tool call parser * reset template * Fix grammar, hide tool_call from output * Fix C++ compilation error in tests/test-chat.cpp Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target. * Update common/chat.cpp Co-authored-by: Kashyap Jois <kjois@iprdgroup.com> * Update common/chat.cpp Co-authored-by: Kashyap Jois <kjois@iprdgroup.com> * Fix for test * revert * Update common/chat.cpp Co-authored-by: Marcel de Vries <marceldev89@gmail.com> * Update common/chat.cpp Co-authored-by: Marcel de Vries <marceldev89@gmail.com> * removed test * Qwen3-Coder XML: handle union schema types and sanitize unsupported branches; add tests - chat-parser: support schema.type as array (e.g. ["number","null"]) in convert_qwen3_param_value() - chat: resolve $refs; allow unions including "string" as freeform; sanitize empty {"not":{}} in anyOf/oneOf before add_schema - tests: add Qwen3-Coder regression ensuring grammar builds with unions and ignores {"not":{}} * Moved common_chat_parse_qwen3_coder_xml * Fix merge oopsie * Sync bundled template with upstream See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja * Fix crash when tool call doesn't start with <tool_call> * model-conversion : add trust_remote_code for orig model run [no ci] (ggml-org#16751) This commit add the trust_remote_code=True argument when loading models using AutoConfig, AutoTokenizer, and AutoModelForCausalLM for the run original model script. The motivation for this is that some models require custom code to be loaded properly, and setting trust_remote_code=True avoids a prompt asking for user confirmation: ```console (venv) $ make causal-run-original-model The repository /path/to/model contains custom code which must be executed to correctly load the model. You can inspect the repository content at /path/to/model. Do you wish to run the custom code? [y/N] N ``` Having this as the default seems like a safe choice as we have to clone or download the models we convert and would be expecting to run any custom code they have. * webui: support q URL parameter (ggml-org#16728) * webui: support q URL parameter Fixes ggml-org#16722 I’ve checked that it works with Firefox’s AI tools * webui: apply suggestions from code review Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com> * chore: update webui static build --------- Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com> --------- Co-authored-by: Benjamin Oldenburg <benjamin.oldenburg@ordis.co.th> Co-authored-by: Marcel de Vries <marceldev89@gmail.com> Co-authored-by: Kashyap Jois <kjois@iprdgroup.com> Co-authored-by: Daniel Bevenius <daniel.bevenius@gmail.com> Co-authored-by: Florian Badie <florianbadie@odrling.xyz> Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

webui: support q URL parameter

e28e4f6

Fixes ggml-org#16722 I’ve checked that it works with Firefox’s AI tools

odrling requested a review from allozaur as a code owner October 22, 2025 22:54

github-actions bot added examples server labels Oct 22, 2025

allozaur requested changes Oct 22, 2025

View reviewed changes

tools/server/webui/src/routes/+page.svelte Outdated Show resolved Hide resolved

tools/server/webui/src/routes/+page.ts Outdated Show resolved Hide resolved

tools/server/webui/src/routes/+page.svelte Outdated Show resolved Hide resolved

webui: apply suggestions from code review

6fe4281

Co-authored-by: Aleksander Grygier <aleksander.grygier@gmail.com>

odrling requested a review from allozaur October 23, 2025 01:55

allozaur requested changes Oct 23, 2025

View reviewed changes

chore: update webui static build

63996b3

odrling requested a review from allozaur October 23, 2025 15:20

allozaur approved these changes Oct 24, 2025

View reviewed changes

allozaur merged commit 69e9ff0 into ggml-org:master Oct 24, 2025
14 checks passed

chansikpark mentioned this pull request Oct 28, 2025

Misc. bug: llama-server integration with Firefox's AI chatbot feature breaks with overly long queries #16830

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

webui: support q URL parameter #16728

webui: support q URL parameter #16728

odrling commented Oct 22, 2025

Uh oh!

allozaur left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

allozaur left a comment

Uh oh!

odrling commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

webui: support q URL parameter #16728

webui: support q URL parameter #16728

Conversation

odrling commented Oct 22, 2025

Uh oh!

allozaur left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

allozaur left a comment

Choose a reason for hiding this comment

Uh oh!

odrling commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants