[Bugfix] Set `VLLM_ALLREDUCE_USE_SYMM_MEM` default to False #24696

yewentao256 · 2025-09-11T21:26:35Z

Purpose

#24111 should be tested with DP case then open to default again

Test

(APIServer pid=454020) INFO:     Started server process [454020]
(APIServer pid=454020) INFO:     Waiting for application startup.
(APIServer pid=454020) INFO:     Application startup complete.

Signed-off-by: yewentao256 <zhyanwentao@126.com>

mergify · 2025-09-11T21:27:12Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @yewentao256.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

gemini-code-assist

Code Review

This pull request correctly disables the VLLM_ALLREDUCE_USE_SYMM_MEM feature by default by changing its environment variable's default value from True to False. This is a sensible approach to temporarily mitigate a bug as described in the PR. The changes are consistently applied, and the existing tests for this feature are correctly configured to explicitly enable it, ensuring continued test coverage. The implementation looks good.

…-default Signed-off-by: yewentao256 <zhyanwentao@126.com>

mgoin

Thanks, let's get this bugfix in for now to fix release

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com>

nvpohanh · 2025-09-16T06:21:33Z

@ilmarkov Could you fix the issue and re-enable VLLM_ALLREDUCE_USE_SYMM_MEM by default so that we can benefit from the faster AllReduce without any env vars? Thanks!

ilmarkov · 2025-09-16T07:44:14Z

@nvpohanh Yes, I am working on this. The easiest solution would be disable symm mem when DP is used (i.e. all devices do only TP or PP) but I am trying to find a way to enable it for all. The problem is torch incorrectly detects overlapping devices here in case when multiple DP processes are running on the same node.

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com>

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False

c9129b0

Signed-off-by: yewentao256 <zhyanwentao@126.com>

mergify bot added the needs-rebase label Sep 11, 2025

gemini-code-assist bot reviewed Sep 11, 2025

View reviewed changes

Merge branch 'main' into wye-set-VLLM_ALLREDUCE_USE_SYMM_MEM-to-false…

ae18f9c

…-default Signed-off-by: yewentao256 <zhyanwentao@126.com>

mgoin approved these changes Sep 11, 2025

View reviewed changes

mgoin changed the title ~~Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False~~ [Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False Sep 11, 2025

mgoin added the bug Something isn't working label Sep 11, 2025

mergify bot removed the needs-rebase label Sep 11, 2025

simon-mo merged commit 1ec2035 into vllm-project:main Sep 11, 2025
5 of 9 checks passed

yewentao256 deleted the wye-set-VLLM_ALLREDUCE_USE_SYMM_MEM-to-false-default branch September 11, 2025 21:33

skyloevil pushed a commit to skyloevil/vllm that referenced this pull request Sep 13, 2025

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False (vllm-pro…

806e5c7

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com>

dsxsteven pushed a commit to dsxsteven/vllm_splitPR that referenced this pull request Sep 15, 2025

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False (vllm-pro…

2ae3109

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False (vllm-pro…

8b64773

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com>

hiyouga mentioned this pull request Oct 4, 2025

[model] add qwen3vl hiyouga/EasyR1#520

Merged

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False (vllm-pro…

0f6e196

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False (vllm-pro…

04def0e

…ject#24696) Signed-off-by: yewentao256 <zhyanwentao@126.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Set `VLLM_ALLREDUCE_USE_SYMM_MEM` default to False #24696

[Bugfix] Set `VLLM_ALLREDUCE_USE_SYMM_MEM` default to False #24696

Uh oh!

yewentao256 commented Sep 11, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Sep 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

nvpohanh commented Sep 16, 2025

Uh oh!

ilmarkov commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False #24696

[Bugfix] Set VLLM_ALLREDUCE_USE_SYMM_MEM default to False #24696

Uh oh!

Conversation

yewentao256 commented Sep 11, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test

Uh oh!

mergify bot commented Sep 11, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nvpohanh commented Sep 16, 2025

Uh oh!

ilmarkov commented Sep 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Bugfix] Set `VLLM_ALLREDUCE_USE_SYMM_MEM` default to False #24696

[Bugfix] Set `VLLM_ALLREDUCE_USE_SYMM_MEM` default to False #24696

yewentao256 commented Sep 11, 2025 •

edited by github-actions bot

Loading