-
-
Notifications
You must be signed in to change notification settings - Fork 9.1k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Misc] Modify the organization of GLM series
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
ready
ONLY add when PR is ready to merge/full CI is needed
#22171
opened Aug 4, 2025 by
jeejeelee
Loading…
4 tasks
[Bugfix] Fix erroneous randomly generated cases in bad word testing
v1
#22170
opened Aug 4, 2025 by
phantomlei3
Loading…
3 of 4 tasks
[Model][V1] Support Ernie MTP
new-model
Requests to new models
speculative-decoding
v1
#22169
opened Aug 4, 2025 by
xyxinyang
Loading…
3 of 4 tasks
[Bugfix] Support full cuda graph with sliding window attention
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#22168
opened Aug 4, 2025 by
WoosukKwon
Loading…
[Bugfix] EPLB load statistics problem
#22167
opened Aug 4, 2025 by
david6666666
Loading…
3 of 4 tasks
[Docs] Update features/disagg_prefill, add v1 examples and development
documentation
Improvements or additions to documentation
#22165
opened Aug 4, 2025 by
david6666666
Loading…
2 of 4 tasks
[Misc] Minor fixes and cleanups for elastic EP
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#22160
opened Aug 4, 2025 by
ruisearch42
Loading…
4 tasks
[Core] Prepare Scheduler, EngineCore, and EngineCoreProc to depend on tpu_commons without circular dependency loop.
v1
#22155
opened Aug 3, 2025 by
yarongmu-google
Loading…
[Bugfix] Fix NemotronH d_inner calculation to prevent incorrect num_heads derivation
#22150
opened Aug 3, 2025 by
danielafrimi
Loading…
fix: kimi_k2 return empty tool call list
frontend
tool-calling
#22149
opened Aug 3, 2025 by
tlipoca9
Loading…
[Perf] add torch sparse mm benchmark
performance
Performance-related issues
#22148
opened Aug 3, 2025 by
yiakwy-xpu-ml-framework-team
Loading…
3 of 4 tasks
[Misc] log more detailed message for ensure_model_parallel_initialized
#22144
opened Aug 3, 2025 by
andyxning
Loading…
4 tasks
[Doc] add backend to doc string of initialize_model_parallel
#22142
opened Aug 3, 2025 by
andyxning
Loading…
4 tasks
[CI/Build] Update causal-conv1d and lm-eval
ci/build
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#22141
opened Aug 3, 2025 by
DarkLight1337
Loading…
1 of 4 tasks
Remove multi-step scheduling
ci/build
codex
performance
Performance-related issues
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
tpu
Related to Google TPUs
v1
#22138
opened Aug 3, 2025 by
WoosukKwon
Loading…
Update rms_norm_kernel by removing redundant global memory loads
#22134
opened Aug 3, 2025 by
bbeckca
Loading…
[Kernel] Add support for block FP8 on SM120 (NVIDIA 5090 and RTX PRO 6000)
ci/build
#22131
opened Aug 2, 2025 by
0xjunhao
Loading…
4 tasks
Use UV_LINK_MODE=copy in Dockerfile to avoid hardlink fail
ci/build
#22128
opened Aug 2, 2025 by
mgoin
Loading…
[Bugfix] Add num_special_tokens_to_add to MistralTokenizer, fixes #22013
#22121
opened Aug 2, 2025 by
ShUl0w
Loading…
3 of 4 tasks
[WIP] vLLM Benchmark suite improvement
ci/build
performance
Performance-related issues
#22119
opened Aug 2, 2025 by
louie-tsai
Loading…
1 of 4 tasks
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.