[Misc] Simplify max tokens in multimodal registry #27500

DarkLight1337 · 2025-10-25T02:17:46Z

Purpose

get_max_tokens_per_item_by_nonzero_modality is redundant because get_max_tokens_per_item_by_modality already considers the limits by only passing in modalities with limit > 0.
Don't create processor twice in get_max_tokens_per_item_by_modality.

Test Plan

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

gemini-code-assist

Code Review

This pull request correctly simplifies the logic for determining the maximum tokens for multimodal models by removing the redundant get_max_tokens_per_item_by_nonzero_modality function and avoiding the double creation of the multimodal processor within get_max_tokens_per_item_by_modality. The changes are clean and improve efficiency. However, I've identified a similar inefficiency in MultiModalBudget.__init__ where the processor is still created twice. Addressing this would further align with the goals of this PR.

vllm/v1/worker/utils.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

ywang96 · 2025-10-25T04:09:47Z

Looks like the one of the tests didn't pass cc @DarkLight1337

ValueError: At most 0 image(s) may be provided in one prompt.

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-10-25T04:15:18Z

Fixed in 10532e5

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

[Misc] Simplfy max tokens in multimodal registry

5cad734

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from Isotr0py October 25, 2025 02:17

DarkLight1337 requested review from WoosukKwon, njhill, robertgshaw2-redhat and ywang96 as code owners October 25, 2025 02:17

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 25, 2025

DarkLight1337 requested review from alexm-redhat, comaniac and heheda12345 as code owners October 25, 2025 02:17

DarkLight1337 added the multi-modality Related to multi-modality (#4194) label Oct 25, 2025

DarkLight1337 requested review from ApostaC and NickLucche as code owners October 25, 2025 02:17

mergify bot added the v1 label Oct 25, 2025

gemini-code-assist bot reviewed Oct 25, 2025

View reviewed changes

vllm/v1/worker/utils.py Show resolved Hide resolved

Optimize

3320334

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 changed the title ~~[Misc] Simplfy max tokens in multimodal registry~~ [Misc] Simplify max tokens in multimodal registry Oct 25, 2025

Isotr0py approved these changes Oct 25, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) October 25, 2025 03:36

Filter modalities

10532e5

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

ywang96 approved these changes Oct 25, 2025

View reviewed changes

vllm-bot merged commit 4c5f632 into vllm-project:main Oct 25, 2025
45 of 47 checks passed

DarkLight1337 deleted the simplify-max-tokens branch October 25, 2025 06:56

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Misc] Simplify max tokens in multimodal registry (vllm-project#27500)

3893339

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025

[Misc] Simplify max tokens in multimodal registry (vllm-project#27500)

441408f

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Simplify max tokens in multimodal registry #27500

[Misc] Simplify max tokens in multimodal registry #27500

Uh oh!

DarkLight1337 commented Oct 25, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

ywang96 commented Oct 25, 2025

Uh oh!

DarkLight1337 commented Oct 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Misc] Simplify max tokens in multimodal registry #27500

[Misc] Simplify max tokens in multimodal registry #27500

Uh oh!

Conversation

DarkLight1337 commented Oct 25, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

ywang96 commented Oct 25, 2025

Uh oh!

DarkLight1337 commented Oct 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

DarkLight1337 commented Oct 25, 2025 •

edited by github-actions bot

Loading