[Misc] Refactor platform to get device specific stream and event #14411

shen-shanshan · 2025-03-07T06:32:43Z

What does this PR do?

Add a __getitem__ method for Platform class.

When using cuda, current_platform['xxx'] is equal to torch.cuda.xxx, whereas when using npu, current_platform['xxx'] is equal to torch.npu.xxx. Thus, we do not need to abstract Stream or Event anymore.

github-actions · 2025-03-07T06:32:53Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Selkh · 2025-03-15T01:52:21Z

There 're also some "torch.cuda.Stream"s in vllm/distributed/parallel_state.py:graph_capture

youkaichao

we don't create abstractions until many platforms need this.

mergify · 2025-04-14T12:00:53Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @shen-shanshan.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

vllm/spec_decode/metrics.py

Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan · 2025-04-15T02:49:32Z

@youkaichao I have updated a new version for this refactor. By directly adding __getattr__() method to Platform, all kinds of hardware can get their Stream, Event or other attribute without modification to vllm. I think this can make vllm more extensible. 😄

youkaichao

okay, this looks better.

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com> Signed-off-by: Yang Wang <elainewy@meta.com>

### What this PR does / why we need it? Remove some parts of metrics patch, since the `cuda` hard code has been fixed by vllm-project/vllm#14411. Signed-off-by: shen-shanshan <467638484@qq.com>

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com> Signed-off-by: Frieda (Jingying) Huang <jingyingfhuang@gmail.com>

### What this PR does / why we need it? Remove some parts of metrics patch, since the `cuda` hard code has been fixed by vllm-project/vllm#14411. Signed-off-by: shen-shanshan <467638484@qq.com>

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com>

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com> Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

### What this PR does / why we need it? Remove some parts of metrics patch, since the `cuda` hard code has been fixed by vllm-project/vllm#14411. Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan changed the title ~~[Misc] format~~ [Misc] Add get_stream_cls() method for Platform class Mar 7, 2025

mergify bot added the speculative-decoding label Mar 7, 2025

shen-shanshan mentioned this pull request Mar 7, 2025

[Platform] Refactor platform to get device specific stream and event vllm-project/vllm-ascend#261

Closed

wangxiyuan mentioned this pull request Mar 7, 2025

[RFC]: Hardware pluggable #11162

Closed

1 task

shen-shanshan force-pushed the patch-2 branch from 846f17a to 434e67a Compare March 19, 2025 09:16

shen-shanshan changed the title ~~[Misc] Add get_stream_cls() method for Platform class~~ [Misc] Refactor platform to get device specific stream and event Mar 19, 2025

youkaichao reviewed Mar 22, 2025

View reviewed changes

shen-shanshan force-pushed the patch-2 branch from 434e67a to 9e3d5df Compare March 26, 2025 06:28

shen-shanshan closed this Apr 14, 2025

shen-shanshan reopened this Apr 14, 2025

mergify bot added the needs-rebase label Apr 14, 2025

wangxiyuan reviewed Apr 14, 2025

View reviewed changes

vllm/spec_decode/metrics.py Outdated Show resolved Hide resolved

update

0386180

Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan force-pushed the patch-2 branch from 9e3d5df to 0386180 Compare April 15, 2025 02:43

mergify bot removed the needs-rebase label Apr 15, 2025

youkaichao approved these changes Apr 21, 2025

View reviewed changes

youkaichao merged commit 7272bfa into vllm-project:main Apr 21, 2025
21 checks passed

yangw-dev pushed a commit to yangw-dev/vllm that referenced this pull request Apr 21, 2025

[Misc] Refactor platform to get device specific stream and event (vll…

6cd3ec6

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com> Signed-off-by: Yang Wang <elainewy@meta.com>

shen-shanshan mentioned this pull request Apr 22, 2025

[Misc] Remove some parts of metrics patch vllm-project/vllm-ascend#603

Merged

shen-shanshan mentioned this pull request Apr 22, 2025

[Misc] Replace cuda hard code with current_platform #16983

Merged

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025

[Misc] Refactor platform to get device specific stream and event (vll…

bd02fff

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com>

lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025

[Misc] Refactor platform to get device specific stream and event (vll…

5fc6c04

…m-project#14411) Signed-off-by: shen-shanshan <467638484@qq.com>

ckhordiasma mentioned this pull request May 14, 2025

nm vllm ent 0.8.5 sync red-hat-data-services/vllm#139

Merged

aarnphm mentioned this pull request May 28, 2025

[Bugfix] Fix spec decode on non-cuda platforms #18501

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Refactor platform to get device specific stream and event #14411

[Misc] Refactor platform to get device specific stream and event #14411

Uh oh!

shen-shanshan commented Mar 7, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Mar 7, 2025

Uh oh!

Selkh commented Mar 15, 2025

Uh oh!

youkaichao left a comment

Uh oh!

mergify bot commented Apr 14, 2025

Uh oh!

Uh oh!

shen-shanshan commented Apr 15, 2025

Uh oh!

youkaichao left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Misc] Refactor platform to get device specific stream and event #14411

[Misc] Refactor platform to get device specific stream and event #14411

Uh oh!

Conversation

shen-shanshan commented Mar 7, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented Mar 7, 2025

Uh oh!

Selkh commented Mar 15, 2025

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

mergify bot commented Apr 14, 2025

Uh oh!

Uh oh!

shen-shanshan commented Apr 15, 2025

Uh oh!

youkaichao left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shen-shanshan commented Mar 7, 2025 •

edited by github-actions bot

Loading