-
-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Enable dumping expert input tensors for GPT-OSS
gpt-oss
Related to GPT-OSS models
#29050
opened Nov 20, 2025 by
mutinifni
Loading…
docs: cleanup TPU documentation and remove outdated examples
documentation
Improvements or additions to documentation
tpu
Related to Google TPUs
#29048
opened Nov 19, 2025 by
RobMulla
Loading…
4 of 5 tasks
Online Rotations to vLLM
ci/build
documentation
Improvements or additions to documentation
llama
Related to Llama models
needs-rebase
rocm
Related to AMD ROCm
#29047
opened Nov 19, 2025 by
gametekker
Loading…
[CI/Build][AMD] Skip if flash_attn_varlen_func not available in test_aiter_flash_attn.py
rocm
Related to AMD ROCm
#29043
opened Nov 19, 2025 by
rasmith
Loading…
fix video frame check error
multi-modality
Related to multi-modality (#4194)
#29041
opened Nov 19, 2025 by
chengyinie
•
Draft
5 tasks
[DeepSeek + LMCache Multiprocess] handle MLA for deepseek model + LMCache Multiprocess connector
deepseek
Related to DeepSeek models
kv-connector
ready
ONLY add when PR is ready to merge/full CI is needed
#29039
opened Nov 19, 2025 by
KuntaiDu
Loading…
5 tasks
[Bug] Fix torch dynamo warning Dynamo detected a call to a
functools.lru_cache
v1
#29038
opened Nov 19, 2025 by
yewentao256
Loading…
[CI] Fix mypy for Related to Google TPUs
v1
vllm/v1/worker
tpu
#29037
opened Nov 19, 2025 by
yewentao256
Loading…
[Core] Replace LogprobsLists with multiple numpy arrays in EngineCoreOutput
v1
#29035
opened Nov 19, 2025 by
Jialin
Loading…
3 of 5 tasks
[Frontend] Support for direct url passing to opencv for video
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
[Core] Avoid list[int] in EngineCoreOutput for GC efficiency
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#29033
opened Nov 19, 2025 by
Jialin
Loading…
3 of 5 tasks
[Core] DO NOT trim spec token lists for future reuse
v1
#29031
opened Nov 19, 2025 by
Jialin
Loading…
3 of 5 tasks
Remove redundant copy in TopKWeightAndReduceNoOP
#29028
opened Nov 19, 2025 by
xyang16
Loading…
5 tasks
Fix boolean nested params, add dict format support, and enhance plotting for vllm bench sweep
performance
Performance-related issues
[Small] Capture AttributeError when checking ray dependency.
#29024
opened Nov 19, 2025 by
huachenheli
Loading…
[CI/Build] Remove skip global cleanup in test_struct_output_generate.py
structured-output
v1
#29022
opened Nov 19, 2025 by
rasmith
Loading…
[CI/Build] Skip lm-format-enforcer tests in test_struct_output_generate.py for now
structured-output
v1
#29021
opened Nov 19, 2025 by
rasmith
Loading…
[Kernel] Separate Triton Attention Kernel Launches for Prefill and Decode for FULL CUDA Graph mode
nvidia
v1
#29020
opened Nov 19, 2025 by
jvlunteren
Loading…
[CI-Build] Fixed numeric issue of test_prefix_prefill on AMD
needs-rebase
rocm
Related to AMD ROCm
#29019
opened Nov 19, 2025 by
him-rh-nm
Loading…
3 of 5 tasks
Updating the mirror of test-amd.yaml as of 2025-11-18
ci/build
needs-rebase
rocm
Related to AMD ROCm
#29016
opened Nov 19, 2025 by
Alexei-V-Ivanov-AMD
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.