Skip to content

Pull requests: ModelTC/LightLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add launch_server and StartArgs
#1119 opened Nov 19, 2025 by sufubao Loading…
Support Qwen3next and prefix cache.
#1115 opened Nov 18, 2025 by sufubao Loading…
[Feat] Add structured generation OpenAI API
#1114 opened Nov 18, 2025 by flyinglandlord Loading…
add flashinfer fused_norm_add op
#1105 opened Nov 13, 2025 by SangChengC Loading…
support Deepseek3.2
#1103 opened Nov 10, 2025 by sufubao Loading…
add flashinfer-trtllm-ragged-prefill-attn
#1099 opened Nov 6, 2025 by SangChengC Loading…
feat: disk cache v1.0
#1098 opened Nov 5, 2025 by blueswhen Loading…
Add qwen3 vl
#1095 opened Nov 4, 2025 by SangChengC Loading…
fix: MTP in chunked prefill mode
#1079 opened Oct 14, 2025 by sufubao Loading…
support interns1
#1060 opened Sep 18, 2025 by xhx1022 Loading…
enable fa3 and fused_shared_experts by default
#1053 opened Sep 15, 2025 by sufubao Loading…
fix tl.where and set the default loader worker number
#1052 opened Sep 11, 2025 by sufubao Loading…
feat: Implementing Past Future Scheduler
#1048 opened Sep 8, 2025 by WuSiYu Loading…
[support] vit and llm disaggregation
#1014 opened Aug 20, 2025 by SangChengC Loading…
add fa3_mtp
#1005 opened Aug 11, 2025 by WANDY666 Loading…
Support Qwen models' dp>1 in PD
#999 opened Aug 5, 2025 by zhhangBian Loading…
add rmsnorm-add fusion kernel
#996 opened Aug 4, 2025 by theNiemand Loading…
Asynchicache
#977 opened Jul 21, 2025 by jinbiaoyu Loading…
Fp8 deepseek
#975 opened Jul 17, 2025 by blueswhen Loading…
cuda graph pool with LRU
#964 opened Jul 8, 2025 by STwangyingrui Loading…
Add fake balance for EP mode
#962 opened Jul 8, 2025 by STwangyingrui Loading…
Multimodal improve
#951 opened Jul 1, 2025 by shihaobai Loading…
feat: Support decode chunk PD serving mode
#944 opened Jun 25, 2025 by zhhangBian Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.