Fix local development for vllm==0.8.5 #140

wallashss · 2025-05-05T17:42:09Z

The problem

The problem started happening when I pulled new code for vllm-spyre and it has updates for vllm. I got this stacktrace:

ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_dynamo/eval_frame.py", line 790, in _optimize
ERROR 05-05 13:57:01 [registry.py:355]     compiler_config=backend.get_compiler_config()
ERROR 05-05 13:57:01 [registry.py:355]                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/__init__.py", line 2237, in get_compiler_config
ERROR 05-05 13:57:01 [registry.py:355]     from torch._inductor.compile_fx import get_patched_config_dict
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/compile_fx.py", line 72, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .fx_passes.joint_graph import joint_graph_passes
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/fx_passes/joint_graph.py", line 19, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from ..pattern_matcher import (
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/pattern_matcher.py", line 96, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .lowering import fallback_node_due_to_unsupported_type
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/lowering.py", line 6430, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from . import kernel
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/kernel/__init__.py", line 1, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from . import mm, mm_common, mm_plus_mm, unpack_mixed_mm
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/kernel/mm.py", line 16, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from torch._inductor.codegen.cpp_gemm_template import CppPackedGemmTemplate
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/codegen/cpp_gemm_template.py", line 19, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .cpp_micro_gemm import CppMicroGemmAMX, create_micro_gemm, LayoutType
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/codegen/cpp_micro_gemm.py", line 16, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .cpp_template_kernel import CppTemplateKernel
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/codegen/cpp_template_kernel.py", line 20, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .cpp_wrapper_cpu import CppWrapperCpu
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/codegen/cpp_wrapper_cpu.py", line 30, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from .wrapper import EnterSubgraphLine, ExitSubgraphLine, WrapperCodeGen
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/codegen/wrapper.py", line 46, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from ..runtime import triton_heuristics
ERROR 05-05 13:57:01 [registry.py:355]   File "/Users/wallashss/develop/vllm-spyre/.pyenv/lib/python3.12/site-packages/torch/_inductor/runtime/triton_heuristics.py", line 55, in <module>
ERROR 05-05 13:57:01 [registry.py:355]     from triton import Config
ERROR 05-05 13:57:01 [registry.py:355] ImportError: cannot import name 'Config' from 'triton' (unknown location)
ERROR 05-05 13:57:01 [registry.py:355]

For newer vllm version (>=0.8.5), vLLM pokes the loaded module and it creates a placeholder for triton. This workaround can make pytorch mess up some code related to torch.compile.

My solution is just delete the triton module, and everything's can work ok again using Dyname Backend as eager in case we are developing on mac os.

Prashant:

PR for triton placeholder for reference: vllm-project/vllm#15099

Signed-off-by: Wallas Santos <wallashss@ibm.com>

github-actions · 2025-05-05T17:42:19Z

👋 Hi! Thank you for contributing to vLLM support on Spyre.
Just a reminder: Make sure that your code passes all the linting checks, otherwise your PR won't be able to be merged. To do so, first install the linting requirements, then run format.sh and commit the changes. This can be done with uv directly:

uv sync --frozen --group lint

Or this can be done with pip:

uv pip compile --group lint > requirements-lint.txt
pip install -r requirements-lint.txt
bash format.sh

Now you are good to go 🚀

vllm_spyre/platform.py

Signed-off-by: Wallas Santos <wallashss@ibm.com>

…to wallas-fix-local Signed-off-by: Wallas Santos <wallashss@ibm.com>

prashantgupta24

yep I can confirm this works

…triton Signed-off-by: Wallas Santos <wallashss@ibm.com>

…to wallas-fix-local Signed-off-by: Wallas Santos <wallashss@ibm.com>

Signed-off-by: Wallas Santos <wallashss@ibm.com>

prashantgupta24 · 2025-05-06T16:36:06Z

I'm also curious to know what changed in 0.8.5 such that this Triton placeholder started behaving incorrectly? I'm assuming this placeholder existed in earlier versions too right?

Edit - I think I found the answer - https://github.com/vllm-project/vllm/pull/15099/files

So the TritonPlaceholder class was introduced 2 weeks ago, which means this did not exist earlier.

wallashss · 2025-05-06T16:56:36Z

I'm also curious to know what changed in 0.8.5 such that this Triton placeholder started behaving incorrectly? I'm assuming this placeholder existed in earlier versions too right?

Edit - I think I found the answer - https://github.com/vllm-project/vllm/pull/15099/files

So the TritonPlaceholder class was introduced 2 weeks ago, which means this did not exist earlier.

Yeah, this is the issue. I think a back contribution to the upstream would be a full fledge placeholder that every imports work, for instance to be able to import a dummy config; or a environment variable that prevents that, due the side effect it can cause on pytorch. Therefore, for cases like ours, and others it would be better to raise an exception than get an weird behavior in other dependencies like it does to the pytorch itself.

fix: local development for vllm==0.8.5

93bf581

Signed-off-by: Wallas Santos <wallashss@ibm.com>

Merge branch 'main' into wallas-fix-local

844918f

wallashss requested a review from prashantgupta24 May 5, 2025 17:43

prashantgupta24 reviewed May 5, 2025

View reviewed changes

vllm_spyre/platform.py Show resolved Hide resolved

wallashss added 3 commits May 5, 2025 15:24

fix: conditional delete of sys.module['triton']

bc5d8c9

Signed-off-by: Wallas Santos <wallashss@ibm.com>

Merge branch 'wallas-fix-local' of github.com:wallashss/vllm-spyre in…

ad4b81b

…to wallas-fix-local Signed-off-by: Wallas Santos <wallashss@ibm.com>

Merge branch 'main' into wallas-fix-local

14e11fe

prashantgupta24 approved these changes May 5, 2025

View reviewed changes

wallashss and others added 4 commits May 6, 2025 13:06

Merge branch 'main' into wallas-fix-local

5915b61

docs: updated comment for the issue of incompatibility of mac os and …

c7e3317

…triton Signed-off-by: Wallas Santos <wallashss@ibm.com>

Merge branch 'wallas-fix-local' of github.com:wallashss/vllm-spyre in…

4a8ffcd

…to wallas-fix-local Signed-off-by: Wallas Santos <wallashss@ibm.com>

fix: linting

ffade8c

Signed-off-by: Wallas Santos <wallashss@ibm.com>

prashantgupta24 approved these changes May 6, 2025

View reviewed changes

wallashss merged commit e0c16c6 into vllm-project:main May 6, 2025
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix local development for vllm==0.8.5 #140

Fix local development for vllm==0.8.5 #140

Uh oh!

wallashss commented May 5, 2025 •

edited by prashantgupta24

Loading

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

Uh oh!

prashantgupta24 left a comment

Uh oh!

prashantgupta24 commented May 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

wallashss commented May 6, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix local development for vllm==0.8.5 #140

Fix local development for vllm==0.8.5 #140

Uh oh!

Conversation

wallashss commented May 5, 2025 • edited by prashantgupta24 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

The problem

Uh oh!

github-actions bot commented May 5, 2025

Uh oh!

Uh oh!

prashantgupta24 left a comment

Choose a reason for hiding this comment

Uh oh!

prashantgupta24 commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

wallashss commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wallashss commented May 5, 2025 •

edited by prashantgupta24

Loading

prashantgupta24 commented May 6, 2025 •

edited

Loading

wallashss commented May 6, 2025 •

edited

Loading