Bump Flashinfer to 0.3.1 #24868

bbartels · 2025-09-15T09:36:08Z

Purpose

Bumps flashinfer to v0.3.1. Thes version comes with some fixes around certain code paths not being AOT'd.

Test Plan

Run CI checks

Test Result

CI checks passed

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

This pull request bumps the FlashInfer version to 0.3.1 in the Dockerfile. However, the corresponding dependency in setup.py has not been updated, which will lead to inconsistent versions being installed depending on the environment. This is a critical issue that needs to be addressed.

gemini-code-assist · 2025-09-15T09:36:42Z

docker/Dockerfile

The comment on the preceding line indicates that this version should be synchronized with the flashinfer extra in setup.py. While this file is updated to v0.3.1, setup.py still specifies flashinfer-python==0.3.0. This discrepancy will cause different versions of FlashInfer to be installed depending on whether the project is built using Docker or installed via pip, which can lead to hard-to-debug issues. Please update setup.py to use version 0.3.1 as well.

pytorch-bot · 2025-09-15T09:36:47Z

No ciflow labels are configured for this repo.
For information on how to enable CIFlow bot see this wiki

pytorch-bot · 2025-09-15T09:37:25Z

No ciflow labels are configured for this repo.
For information on how to enable CIFlow bot see this wiki

Signed-off-by: bbartels <benjamin@bartels.dev>

pytorch-bot · 2025-09-15T10:35:52Z

No ciflow labels are configured for this repo.
For information on how to enable CIFlow bot see this wiki

mgoin · 2025-09-15T17:12:14Z

@bbartels Could you please add to the description the reason for the update?

bbartels · 2025-09-15T17:20:45Z

Done @mgoin

mgoin

Passed the blackwell test, LGTM!

Signed-off-by: bbartels <benjamin@bartels.dev>

Signed-off-by: bbartels <benjamin@bartels.dev> [gpt-oss] Add IncompleteDetails to ResponsesRepsonse (vllm-project#24561) Signed-off-by: Andrew Xia <axia@meta.com> [gpt-oss][1a] create_responses stream outputs BaseModel type, api server is SSE still (vllm-project#24759) Signed-off-by: Andrew Xia <axia@meta.com> [Performance] Remove redundant clone() calls in cutlass_mla (vllm-project#24891) [Bug] Fix Cutlass Scaled MM Compilation Error (vllm-project#24887) Signed-off-by: yewentao256 <zhyanwentao@126.com> [ci] fix wheel names for arm wheels (vllm-project#24898) Signed-off-by: simon-mo <simon.mo@hey.com> [Tests] fix initialization of kv hash in tests (vllm-project#24273) Signed-off-by: Mickael Seznec <mickael@mistral.ai> [Compile] Fix noop_elimination pass and add tests for noop_elimination (vllm-project#24880) Signed-off-by: zjy0516 <riverclouds.zhu@qq.com> Propagate entire tokens to connector for resumed preemptions Signed-off-by: Qier Li <kevin44036@gmail.com> Fix pre-commit Signed-off-by: Qier Li <kevin44036@gmail.com> Rename field and nullify empty lists Signed-off-by: Qier Li <kevin44036@gmail.com> Update vllm/v1/core/sched/scheduler.py Co-authored-by: Nick Hill <nhill@redhat.com> Signed-off-by: Qier Li <kevin44036@gmail.com> Add unit test for preemption resumption Signed-off-by: Qier Li <kevin44036@gmail.com>

Signed-off-by: bbartels <benjamin@bartels.dev> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Signed-off-by: bbartels <benjamin@bartels.dev>

Signed-off-by: bbartels <benjamin@bartels.dev> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

gemini-code-assist bot reviewed Sep 15, 2025

View reviewed changes

mergify bot added the ci/build label Sep 15, 2025

pytorch-bot bot removed the ci/build label Sep 15, 2025

mergify bot added the ci/build label Sep 15, 2025

Bump flashinfer to 0.3.1

8307268

Signed-off-by: bbartels <benjamin@bartels.dev>

bbartels force-pushed the patch-2 branch from 722cfa0 to 8307268 Compare September 15, 2025 10:35

pytorch-bot bot removed the ci/build label Sep 15, 2025

mergify bot added the ci/build label Sep 15, 2025

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Sep 15, 2025

mgoin approved these changes Sep 15, 2025

View reviewed changes

simon-mo merged commit 94b03f8 into vllm-project:main Sep 15, 2025
91 of 94 checks passed

tlrmchlsmth pushed a commit to tlrmchlsmth/vllm that referenced this pull request Sep 15, 2025

Bump Flashinfer to 0.3.1 (vllm-project#24868)

040bb13

Signed-off-by: bbartels <benjamin@bartels.dev>

FeiDaLI pushed a commit to FeiDaLI/vllm that referenced this pull request Sep 25, 2025

Bump Flashinfer to 0.3.1 (vllm-project#24868)

aac00e6

Signed-off-by: bbartels <benjamin@bartels.dev>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025

Bump Flashinfer to 0.3.1 (vllm-project#24868)

82b94de

Signed-off-by: bbartels <benjamin@bartels.dev> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

choprahetarth pushed a commit to Tandemn-Labs/vllm that referenced this pull request Oct 11, 2025

Bump Flashinfer to 0.3.1 (vllm-project#24868)

fbf3f47

Signed-off-by: bbartels <benjamin@bartels.dev>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025

Bump Flashinfer to 0.3.1 (vllm-project#24868)

8a25cd3

Signed-off-by: bbartels <benjamin@bartels.dev> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Bump Flashinfer to 0.3.1 #24868

Bump Flashinfer to 0.3.1 #24868

Uh oh!

bbartels commented Sep 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

mgoin commented Sep 15, 2025

Uh oh!

bbartels commented Sep 15, 2025

Uh oh!

mgoin left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Bump Flashinfer to 0.3.1 #24868

Bump Flashinfer to 0.3.1 #24868

Uh oh!

Conversation

bbartels commented Sep 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

CI checks passed

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

pytorch-bot bot commented Sep 15, 2025

Uh oh!

mgoin commented Sep 15, 2025

Uh oh!

bbartels commented Sep 15, 2025

Uh oh!

mgoin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bbartels commented Sep 15, 2025 •

edited by github-actions bot

Loading