Skip to content

Conversation

@hmellor
Copy link
Member

@hmellor hmellor commented Oct 6, 2025

  • Adds the MixtureOfExperts mixin to TransformersMoEBase
  • Adds necessary methods and sets the necessary state for this mixin to function

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@hmellor hmellor requested a review from Isotr0py October 6, 2025 09:23
@hmellor hmellor moved this to In Progress in Transformers backend Oct 6, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request enables expert parallel load balancing (EPLB) for Mixture-of-Experts models using the Transformers backend. This is achieved by implementing the MixtureOfExperts interface in TransformersMoEBase and adding the necessary state and methods for EPLB to function. The changes also include some nice refactoring in transformers.py to consistently use process group objects.

I've found one critical issue in the implementation of update_physical_experts_metadata that would prevent dynamic load balancing from working correctly. Please see my specific comment for details.

Copy link
Member

@Isotr0py Isotr0py left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@hmellor hmellor enabled auto-merge (squash) October 6, 2025 09:35
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 6, 2025
@hmellor hmellor merged commit 0340f45 into vllm-project:main Oct 6, 2025
56 checks passed
@hmellor hmellor deleted the transformers-backend-eplb branch October 6, 2025 11:20
@github-project-automation github-project-automation bot moved this from In Progress to Done in Transformers backend Oct 6, 2025
karan pushed a commit to karan/vllm that referenced this pull request Oct 6, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>
southfreebird pushed a commit to southfreebird/vllm that referenced this pull request Oct 7, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request Nov 10, 2025
…project#26287)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

2 participants