[Core] Simplify mm processing cache #22457

DarkLight1337 · 2025-08-07T13:11:31Z

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.

Purpose

Refactor the code related to multimodal processing and serialization in preparation for moving processing cache from P0 to P1.

Key changes:

Move MultiModalKwargs out of the inner _apply_hf_processor_text_mm into the more outer _apply_hf_processor and _cached_apply_hf_processor.
Remove unnecessary hashing inside ProcessingCache class.
Remove unnecessary wrapper classes ProcessingCacheOptionalItem and ProcessingCacheItem.
Split up serialization logic into individual methods to be more readable.

Test Plan

Test Result

(Optional) Documentation Update

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

github-actions · 2025-08-07T13:11:41Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This pull request refactors the multimodal processing cache to simplify its logic. The key changes include moving the MultiModalKwargs creation to a higher level, removing hashing logic from within the ProcessingCache class, and eliminating unnecessary wrapper classes. The changes are well-structured and achieve the goal of simplification. I've found one issue with a type hint that should be addressed.

vllm/v1/serial_utils.py

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py

LGTM, thanks for simplifying!

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Noam Gat <noamgat@gmail.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Paul Pak <paulpak58@gmail.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

[Core] Simplify mm processing cache

6dec906

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from Isotr0py August 7, 2025 13:11

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 7, 2025

DarkLight1337 requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat, sighingnow and ywang96 as code owners August 7, 2025 13:11

mergify bot added multi-modality Related to multi-modality (#4194) qwen Related to Qwen models v1 labels Aug 7, 2025

gemini-code-assist bot reviewed Aug 7, 2025

View reviewed changes

vllm/v1/serial_utils.py Outdated Show resolved Hide resolved

DarkLight1337 added 3 commits August 7, 2025 13:20

Keep mypy happy

9963550

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Fix

5f9502f

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Clean

039f747

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py approved these changes Aug 7, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) August 7, 2025 13:34

DarkLight1337 added 2 commits August 7, 2025 13:37

Fix test

758ec88

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Make mypy happy

58d4fac

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

vllm-bot merged commit 8c9da6b into vllm-project:main Aug 7, 2025
35 of 44 checks passed

DarkLight1337 deleted the simply-mm-processing-cache branch August 7, 2025 16:47

jinzhen-lin pushed a commit to jinzhen-lin/vllm that referenced this pull request Aug 9, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

b84e781

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Jinzhen Lin <linjinzhen@hotmail.com>

DarkLight1337 mentioned this pull request Aug 9, 2025

[CI Failure]: Distributed Tests (2 GPUs) - Mllama TP=2 results divergence and deadlock issue #22559

Closed

3 tasks

noamgat pushed a commit to noamgat/vllm that referenced this pull request Aug 9, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

0ca26b4

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Noam Gat <noamgat@gmail.com>

DarkLight1337 mentioned this pull request Aug 9, 2025

[Core] Use individual MM items in P0/P1 cache and model runner #22570

Merged

4 tasks

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

be7b748

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Paul Pak <paulpak58@gmail.com>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

520a531

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 19, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

9c103db

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

6412078

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

c28de7f

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

[Core] Simplify mm processing cache (vllm-project#22457)

8342f59

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Core] Simplify mm processing cache #22457

[Core] Simplify mm processing cache #22457

Uh oh!

DarkLight1337 commented Aug 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Isotr0py left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

[Core] Simplify mm processing cache #22457

[Core] Simplify mm processing cache #22457

Uh oh!

Conversation

DarkLight1337 commented Aug 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Essential Elements of an Effective PR Description Checklist

Purpose

Test Plan

Test Result

(Optional) Documentation Update

Uh oh!

github-actions bot commented Aug 7, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Isotr0py left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

DarkLight1337 commented Aug 7, 2025 •

edited by github-actions bot

Loading