-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
[V0 Deprecation] Enable the remaining multimodal tests in V1 #25307
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request addresses a hanging issue in the AWQ test on vLLM's V1 engine by limiting the number of PyTorch threads to one during model initialization. The approach is sound and correctly resolves the likely cause of the hang in a multiprocessing context. The changes also correctly re-enable the test for V1 by removing the environment variable override. I've added one comment to refactor the duplicated code blocks for running the source and quantized models into a helper function. This will improve the code's maintainability and readability.
|
Actually let me edit this PR to migrate more tests |
So the hanging issue also appears on tests just migrated to V1 recently? Perhaps there are some issues about processes forking... |
|
Yes, it seems this context is only necessary for V1. |
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
|
cc @WoosukKwon after this PR, multimodal models tests should be fully V1 now |
WoosukKwon
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for doing this!
|
The |
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: charlifu <charlifu@amd.com>
Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: yewentao256 <zhyanwentao@126.com>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk>
…oject#25307) Signed-off-by: DarkLight1337 <tlleungac@connect.ust.hk> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Purpose
Use
set_default_torch_num_threadsto avoid hanging issueTest Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.mdandexamplesfor a new model.