[Misc] Clean up MiniCPM-V/O code #15337

DarkLight1337 · 2025-03-22T16:44:39Z

Clean up the code before trying to support the model on V1

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

github-actions · 2025-03-22T16:44:48Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

DarkLight1337 · 2025-03-22T16:45:33Z

tests/models/multimodal/processing/test_common.py

+    if msg is None:
+        assert a == b
+    else:
+        assert a == b, msg


These changes are to let pytest show the non-matching items in more detail

DarkLight1337 · 2025-03-22T16:46:02Z

vllm/model_executor/models/gemma3_mm.py

-            assert isinstance(images, list)
-


Unnecessary check since the data is parsed in the next line

DarkLight1337 · 2025-03-22T16:48:50Z

tests/models/multimodal/processing/test_common.py

    "TIGER-Lab/Mantis-8B-siglip-llama3",
    "mistralai/Pixtral-12B-2409",
    "mistral-community/pixtral-12b",
+    "openbmb/MiniCPM-Llama3-V-2_5",


Need this to test the else branch in _base_call_hf_processor

DarkLight1337 · 2025-03-22T16:50:49Z

Marking as draft until I can get the relevant tests to pass locally

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-03-24T05:00:21Z

Ready to review now.

Isotr0py · 2025-03-24T05:53:14Z

Seems MiniCPM-V test is OOM on CI now... (https://buildkite.com/vllm/fastcheck/builds/17957#0195c650-42fc-4448-9309-7609ca41b3ed/205-3895)

DarkLight1337 · 2025-03-24T08:26:20Z

This simplified approach is not nearly as memory efficient, let's see how to optimize this while still keeping the code readable...

Old implementation:

INFO 03-24 08:10:12 [worker.py:267] model weights take 15.19GiB; non_torch_memory takes 0.06GiB; PyTorch activation peak memory takes 3.99GiB; the rest of the memory reserved for KV Cache is 0.55GiB.

New implementation:

INFO 03-24 07:52:04 [worker.py:270] model weights take 15.19GiB; non_torch_memory takes 0.07GiB; PyTorch activation peak memory takes 4.95GiB; the rest of the memory reserved for KV Cache is -0.42GiB.

DarkLight1337 · 2025-03-24T10:37:13Z

Hmm this is odd, the memory usage remains exactly the same before get_embedding_with_vision is called in the model. I checked using torch.cuda.memory_allocated(), torch.cuda.memory_reserved() and torch.cuda.max_memory_reserved().

Old implementation:

# In GiB
allocated0 15.366820335388184
reserved0 15.44140625
max0 15.44140625
allocated1 15.500089168548584
reserved1 20.603515625
max1 20.603515625

New implementation:

# In GiB
allocated0 15.366820335388184
reserved0 15.44140625
max0 15.44140625
allocated1 15.508512020111084
reserved1 20.537109375
max1 21.255859375

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 · 2025-03-24T13:35:49Z

vllm/model_executor/models/minicpmv.py

-                num_slices = mm_data[modality][f"{modality}_num_slices"][b][
-                    pos]
-                slice_start_idx = mm_slice_counts[modality]
-                slice_end_idx = slice_start_idx + num_slices
-                pixel_values_flat += mm_data[modality]["pixel_values"][b][
-                    slice_start_idx:slice_end_idx]
-                tgt_sizes_flat += mm_data[modality]["tgt_sizes"][b][
-                    slice_start_idx:slice_end_idx]


Upon further inspection, the previous model implementation actually fails to extract all of the slices. (The last slice_end_idx does not match the length of mm_data[modality]["pixel_values"][b]). So the new implementation is more correct but also uses more memory.

I'll just disable the multi-image tests in test CI

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Wes Medford <wryanmedford@gmail.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

[Misc] Clean up MiniCPM-V/O code

1dfc902

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 requested a review from Isotr0py March 22, 2025 16:44

DarkLight1337 requested a review from ywang96 as a code owner March 22, 2025 16:44

mergify bot added the multi-modality Related to multi-modality (#4194) label Mar 22, 2025

DarkLight1337 commented Mar 22, 2025

View reviewed changes

DarkLight1337 marked this pull request as draft March 22, 2025 16:50

DarkLight1337 added 2 commits March 24, 2025 03:59

Fixes

352c3bc

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Merge branch 'main' into minicpm-cleanup

aabed82

DarkLight1337 marked this pull request as ready for review March 24, 2025 04:00

DarkLight1337 added 2 commits March 24, 2025 10:38

Fix embeds

7f0307a

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Update

1c45aa8

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 commented Mar 24, 2025

View reviewed changes

Clean

dd999b3

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Mar 24, 2025

Update tests

289619c

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

Isotr0py approved these changes Mar 24, 2025

View reviewed changes

Fix OOM

4789620

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

mergify bot added the documentation Improvements or additions to documentation label Mar 25, 2025

DarkLight1337 enabled auto-merge (squash) March 25, 2025 09:32

DarkLight1337 merged commit a9e879b into vllm-project:main Mar 25, 2025
39 checks passed

DarkLight1337 deleted the minicpm-cleanup branch March 25, 2025 10:26

erictang000 pushed a commit to erictang000/vllm that referenced this pull request Mar 25, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

7505619

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

wrmedford pushed a commit to wrmedford/vllm that referenced this pull request Mar 26, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

ebedbda

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Wes Medford <wryanmedford@gmail.com>

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

fc8c6f7

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

8bbe949

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

e5f9383

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

[Misc] Clean up MiniCPM-V/O code (vllm-project#15337)

fd9d5ab

Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Uh oh!

[Misc] Clean up MiniCPM-V/O code #15337

[Misc] Clean up MiniCPM-V/O code #15337

Uh oh!

Conversation

DarkLight1337 commented Mar 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 22, 2025

Uh oh!

DarkLight1337 Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Mar 22, 2025

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Mar 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Mar 22, 2025

Uh oh!

DarkLight1337 commented Mar 24, 2025

Uh oh!

Isotr0py commented Mar 24, 2025

Uh oh!

DarkLight1337 commented Mar 24, 2025

Uh oh!

DarkLight1337 commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DarkLight1337 Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 Mar 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

DarkLight1337 commented Mar 22, 2025 •

edited by github-actions bot

Loading

DarkLight1337 Mar 22, 2025 •

edited

Loading

DarkLight1337 Mar 22, 2025 •

edited

Loading

DarkLight1337 commented Mar 24, 2025 •

edited

Loading

DarkLight1337 Mar 24, 2025 •

edited

Loading