-
-
Notifications
You must be signed in to change notification settings - Fork 7.7k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[doc] fix the list rendering issue - security.md
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18982
opened May 31, 2025 by
reidliu41
Loading…
[BugFix] Fix multi-node offline data-parallel
bug
Something isn't working
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#18981
opened May 31, 2025 by
njhill
Loading…
fix security issue of logging llm output
frontend
ready
ONLY add when PR is ready to merge/full CI is needed
tool-calling
#18980
opened May 31, 2025 by
luccafong
Loading…
[Core] Remove unnecessary copy of multi modal input embeddings
v1
#18973
opened May 30, 2025 by
lgeiger
Loading…
[V1][Spec Decode][Ngram] 1.35x gain -> 1.95x gain on InstructCoder with prompt fix
#18971
opened May 30, 2025 by
ekagra-ranjan
Loading…
[P/D] NixlConnector use cache device index for memory registration
ready
ONLY add when PR is ready to merge/full CI is needed
#18969
opened May 30, 2025 by
ptarasiewiczNV
Loading…
Let max_num_batched_tokens use human_readable_int for large numbers
#18968
opened May 30, 2025 by
mgoin
Loading…
[Bugfix][core] Prefix caching enabled causes incorrect outputs
#18957
opened May 30, 2025 by
quanliu1991
Loading…
Abstract mooncake store connector to kv store connector
#18936
opened May 30, 2025 by
maobaolong
Loading…
[Misc] reuse num_tokens_across_dp of get_dp_padding to avoid unnecessary dp all reduce in set_forward_context
v1
#18935
opened May 30, 2025 by
izhuhaoran
Loading…
Adding "LoRA Test %N" to AMD production tests
ci/build
rocm
Related to AMD ROCm
#18929
opened May 29, 2025 by
Concurrensee
Loading…
feat: add data parallel rank to KVEventBatch
documentation
Improvements or additions to documentation
v1
#18925
opened May 29, 2025 by
PeaBrane
Loading…
[Neuron] Add Multi-Modal model support for Neuron
documentation
Improvements or additions to documentation
ready
ONLY add when PR is ready to merge/full CI is needed
#18921
opened May 29, 2025 by
aws-satyajith
Loading…
[Core] Remove int32->int64->int32 overhead in FlashInfer sampling
v1
#18920
opened May 29, 2025 by
lgeiger
Loading…
[Misc] Fix path and python alias errors in disagg_prefill exmaples
documentation
Improvements or additions to documentation
#18919
opened May 29, 2025 by
Jeffwan
Loading…
update the arch list for Blackwell support on nightly dockerfile
ci/build
#18912
opened May 29, 2025 by
kushanam
Loading…
[BugFix] Pydantic part 2
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#18911
opened May 29, 2025 by
ProExpertProg
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.