mtmd: add --image-min/max-tokens #16921

ngxson · 2025-11-01T18:50:15Z

Changes in mtmd API:

Adds image_min_tokens and image_max_tokens to mtmd_context_params

Changes in llama-mtmd-cli and llama-server:

Adds --image-min-tokens N and --image-max-tokens N arguments

* origin/master: (169 commits) opencl: support imrope (ggml-org#16914) fix: Viewing multiple PDF attachments (ggml-org#16974) model-conversion : pass config to from_pretrained (ggml-org#16963) server : add props.model_alias (ggml-org#16943) ggml: CUDA: add head size 72 for flash-attn (ggml-org#16962) mtmd: add --image-min/max-tokens (ggml-org#16921) mtmd: pad mask for qwen2.5vl (ggml-org#16954) ggml : LoongArch fixes (ggml-org#16958) sync: minja (glm 4.6 & minmax m2 templates) (ggml-org#16949) SYCL: optimized repeat_back kernel (3× fewer asm instructions, 2× faster)Feature/sycl repeat back opt (ggml-org#16869) feat(webui): improve LaTeX rendering with currency detection (ggml-org#16508) test-backend-ops : fix segfault in moe-expert-reduce test in support mode and coverage (ggml-org#16936) ci : disable failing riscv cross build (ggml-org#16952) model: add Janus Pro for image understanding (ggml-org#16906) clip : use FA (ggml-org#16837) server : support unified cache across slots (ggml-org#16736) common : move gpt-oss reasoning processing to init params (ggml-org#16937) docs: remove llama_sampler_accept reference in sampling sample usage (ggml-org#16920) CUDA: add FLOOR, CEIL, ROUND, TRUNC unary ops (ggml-org#16917) devops: fix failing s390x docker build (ggml-org#16918) ...

mtmd: add --image-min/max-tokens

70cc330

ngxson requested a review from ggerganov as a code owner November 1, 2025 18:50

github-actions bot added examples server labels Nov 1, 2025

Merge branch 'master' into xsn/mtmd_custom_min_max_tokens

79b98db

ggerganov approved these changes Nov 3, 2025

View reviewed changes

ngxson merged commit 070ff4d into master Nov 3, 2025
69 of 73 checks passed

GittyBurstein pushed a commit to yael-works/llama.cpp that referenced this pull request Nov 5, 2025

mtmd: add --image-min/max-tokens (ggml-org#16921)

f92a084

wqerrewetw mentioned this pull request Nov 5, 2025

Eval bug: Qwen3-VL-8B freezes on image processing tasks #17012

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

mtmd: add --image-min/max-tokens #16921

mtmd: add --image-min/max-tokens #16921

Uh oh!

ngxson commented Nov 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mtmd: add --image-min/max-tokens #16921

mtmd: add --image-min/max-tokens #16921

Uh oh!

Conversation

ngxson commented Nov 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants