Migrate LlavaImageInputs to TensorSchema #21770

bbeckca · 2025-07-28T15:33:42Z

Purpose

This PR migrates LlavaImageInputs from a TypedDict-based definition to a structured TensorSchema model with runtime shape validation. This brings it in line with recent changes to Phi3VImagePixelInputs, and is part of a broader effort to improve input contract enforcement and debug-ability across multi-modal models.

Test Plan

Confirm validation works via standalone tests in tests/standalone_test/test_tensor_schema.py and rely on CI to check integration.

Test Result

(venv) benjibeck@Benjis-MBP vllm % python3 -m pytest tests/standalone_tests/test_tensor_schema.py -v --log-cli-level=DEBUG
======================================================================================================================================================================================= test session starts ========================================================================================================================================================================================
platform darwin -- Python 3.9.6, pytest-8.4.1, pluggy-1.6.0 -- /Users/benjibeck/Projects/vllm/venv/bin/python3
cachedir: .pytest_cache
rootdir: /Users/benjibeck/Projects/vllm
configfile: pyproject.toml
plugins: anyio-4.9.0
collected 14 items                                                                                                                                                                                                                                                                                                                                                                                 

tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_valid_tensor PASSED                                                                                                                                                                                                                                                                                                         [  7%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_optional_fields PASSED                                                                                                                                                                                                                                                                                                      [ 14%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_constant_dim_failure PASSED                                                                                                                                                                                                                                                                                                 [ 21%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_symbolic_dim_mismatch PASSED                                                                                                                                                                                                                                                                                                [ 28%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_list_tensor_valid PASSED                                                                                                                                                                                                                                                                                                    [ 35%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_variable_patch_counts_valid PASSED                                                                                                                                                                                                                                                                                          [ 42%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_tuple_tensor_valid PASSED                                                                                                                                                                                                                                                                                                   [ 50%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_inconsistent_shapes_in_list PASSED                                                                                                                                                                                                                                                                                          [ 57%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_empty_list PASSED                                                                                                                                                                                                                                                                                                           [ 64%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_validation_disabled_skips_shape_check PASSED                                                                                                                                                                                                                                                                                [ 71%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_with_valid_resolve_binding_dims PASSED                                                                                                                                                                                                                                                                                      [ 78%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_with_invalid_resolve_binding_dims PASSED                                                                                                                                                                                                                                                                                    [ 85%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_with_list_of_symbolic_dim PASSED                                                                                                                                                                                                                                                                                            [ 92%]
tests/standalone_tests/test_tensor_schema.py::test_tensor_schema_with_list_of_symbolic_dim_mismatch_in_length PASSED                                                                                                                                                                                                                                                                         [100%]

github-actions · 2025-07-28T15:33:51Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

gemini-code-assist

Code Review

This PR migrates LlavaImageInputs to TensorSchema for better input validation. An edge case where an empty list of pixel_values could cause a server crash was identified and should be addressed.

gemini-code-assist · 2025-07-28T15:35:26Z

vllm/model_executor/models/llava.py

When pixel_values is an empty list, flatten_bn(pixel_values, concat=True) on line 597 will raise an exception because it calls torch.cat([]). This can crash the server if a request has an empty list for pixel_values. Add a check for an empty pixel_values list before this block to prevent this.

if pixel_values: expected_h = expected_w = self.config.vision_config.image_size return LlavaImagePixelInputs( type="pixel_values", pixel_values=flatten_bn(pixel_values, concat=True), resolve_bindings={ "h": expected_h, "w": expected_w }, )

That's a valid concern, but I'd like to avoid introducing new enforcement with the migration. Happy to update if others feel it'd be helpful.

bbeckca · 2025-07-28T15:38:25Z

vllm/model_executor/models/llava.py

Despite this comment about varying height/width, I'm adding enforcement to match existing behaviors in _validate_pixel_values. Please feel free to correct. @DarkLight1337 @Isotr0py

def _validate_pixel_values(self, data: torch.Tensor) -> torch.Tensor: h = w = self.config.vision_config.image_size expected_dims = (3, h, w) actual_dims = tuple(data.shape[1:]) if actual_dims != expected_dims: expected_expr = ("batch_size", *map(str, expected_dims)) raise ValueError( f"The expected shape of pixel values is {expected_expr}. " f"You supplied {tuple(data.shape)}.") return data

bbeckca · 2025-07-28T15:40:00Z

vllm/model_executor/models/llava.py

No enforcement was previously applied, so I skipped adding validations against (num_channels, height, width). Feel free to let me know if there's other preferences. cc @DarkLight1337 @Isotr0py

DarkLight1337 · 2025-08-07T06:07:57Z

Can you merge from main? It should fix the CI

bbeckca · 2025-08-08T14:58:12Z

Can you merge from main? It should fix the CI

Able to reproduce the main branch CI failures locally. These appear unrelated to this PR. Will rebase once the upstream issue is fixed.

DarkLight1337 · 2025-08-08T14:59:25Z

Retrying MM test

bbeckca · 2025-08-08T19:49:50Z

Retrying MM test

Sorry I missed that. Will take a closer look.

bbeckca · 2025-08-09T15:49:13Z

Took a closer look, but the MM test failure seems to happen with latest on main. It seems related to downloading image for Pixtral, so unrelated to these changes?

tests/models/multimodal/generation/test_pixtral.py:116: in <module>
    _create_engine_inputs(IMG_URLS),
tests/models/multimodal/generation/test_pixtral.py:76: in _create_engine_inputs
    tokenized = tokenizer.encode_chat_completion(request)
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/mistral.py:379: in encode_chat_completion
    return self.instruct_tokenizer.encode_instruct(instruct_request)
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/instruct.py:179: in encode_instruct
    new_tokens, new_images, new_audios = self.encode_user_message(
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/instruct.py:449: in encode_user_message
    tokens, image, audio = self.encode_user_content(
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/instruct.py:762: in encode_user_content
    chunk_tokens, chunk_image, _ = self._encode_content_chunk(chunk)
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/instruct.py:688: in _encode_content_chunk
    img_encoding = self.image_encoder(chunk)
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/image.py:224: in __call__
    image = image_from_chunk(content)
venv/lib/python3.9/site-packages/mistral_common/tokens/tokenizers/image.py:92: in image_from_chunk
    return download_image(chunk.get_url())
venv/lib/python3.9/site-packages/mistral_common/image.py:33: in download_image
    raise RuntimeError(f"Error downloading the image from {url}: {e}.")
E   RuntimeError: Error downloading the image from https://picsum.photos/id/27/500/500: 525 Server Error: <none> for url: https://picsum.photos/id/27/500/500.

Signed-off-by: Benji Beck <benjibeck@meta.com>

DarkLight1337 · 2025-08-09T15:51:07Z

Retrying

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Paul Pak <paulpak58@gmail.com>

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

Signed-off-by: Benji Beck <benjibeck@meta.com>

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

Signed-off-by: Benji Beck <benjibeck@meta.com>

gemini-code-assist bot reviewed Jul 28, 2025

View reviewed changes

bbeckca commented Jul 28, 2025

View reviewed changes

bbeckca force-pushed the llava branch from f0efe39 to 61608fa Compare August 5, 2025 14:42

Isotr0py approved these changes Aug 5, 2025

View reviewed changes

Isotr0py enabled auto-merge (squash) August 5, 2025 16:47

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 5, 2025

auto-merge was automatically disabled August 6, 2025 15:07
Head branch was pushed to by a user without write access

bbeckca force-pushed the llava branch from 61608fa to e968d72 Compare August 6, 2025 15:07

Isotr0py enabled auto-merge (squash) August 6, 2025 15:14

auto-merge was automatically disabled August 7, 2025 14:30
Head branch was pushed to by a user without write access

bbeckca force-pushed the llava branch from e968d72 to 629a433 Compare August 7, 2025 14:30

Migrate LlavaImageInputs to TensorSchema

b9342ec

Signed-off-by: Benji Beck <benjibeck@meta.com>

bbeckca force-pushed the llava branch from 629a433 to b9342ec Compare August 10, 2025 17:11

vllm-bot merged commit 06da44f into vllm-project:main Aug 11, 2025
35 of 43 checks passed

paulpak58 pushed a commit to paulpak58/vllm that referenced this pull request Aug 13, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

fabe477

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Paul Pak <paulpak58@gmail.com>

diegocastanibm pushed a commit to diegocastanibm/vllm that referenced this pull request Aug 15, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

f8ae97b

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Diego-Castan <diego.castan@ibm.com>

yiliu30 pushed a commit to yiliu30/vllm-fork that referenced this pull request Aug 19, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

feef1c6

Signed-off-by: Benji Beck <benjibeck@meta.com>

epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

fc8f298

Signed-off-by: Benji Beck <benjibeck@meta.com>

xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

e0e5442

Signed-off-by: Benji Beck <benjibeck@meta.com> Signed-off-by: Xiao Yu <xiao.yu@amd.com>

zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025

Migrate LlavaImageInputs to TensorSchema (vllm-project#21770)

30b96cc

Signed-off-by: Benji Beck <benjibeck@meta.com>

Uh oh!

Migrate LlavaImageInputs to TensorSchema #21770

Migrate LlavaImageInputs to TensorSchema #21770

Uh oh!

Conversation

bbeckca commented Jul 28, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

github-actions bot commented Jul 28, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

bbeckca Jul 29, 2025

Choose a reason for hiding this comment

Uh oh!

bbeckca Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

bbeckca Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented Aug 7, 2025

Uh oh!

bbeckca commented Aug 8, 2025

Uh oh!

DarkLight1337 commented Aug 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bbeckca commented Aug 8, 2025

Uh oh!

bbeckca commented Aug 9, 2025

Uh oh!

DarkLight1337 commented Aug 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

bbeckca commented Jul 28, 2025 •

edited by github-actions bot

Loading

bbeckca Jul 28, 2025 •

edited

Loading

bbeckca Jul 28, 2025 •

edited

Loading

DarkLight1337 commented Aug 8, 2025 •

edited

Loading