-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
bug: [https://nvbugs/5368507] Fix test_generate_with_seed.
#6206
opened Jul 20, 2025 by
bobboli
Loading…
[https://nvbugs/5378031] Hopper W4A8 MoE supports ModelOpt ckpt for PyT backend
#6200
opened Jul 20, 2025 by
rosenrodt
Loading…
[https://nvbugs/5361178][fix]: Json schema support in trtllm-serve using xgrammar
Community want to contribute
PRs initiated from Community
#6197
opened Jul 18, 2025 by
mayani-nv
Loading…
fix: Allreduce Strategy is not correctly set for MNNVL fallback.
#6194
opened Jul 18, 2025 by
timlee0212
Loading…
fix: Ensure that Python stub generation works against libnvidia-ml stubs
#6188
opened Jul 18, 2025 by
MartinMarciniszyn
Loading…
[fix] Correct the returned value of has_spec_drafter
#6178
opened Jul 18, 2025 by
ziyixiong-nv
Loading…
[Perf]: Add residual, norm and AR fusions for llama and nemotron_nas models
#6157
opened Jul 17, 2025 by
NVShreyas
Loading…
[feat] Enable TP and batching for PixtralVisionModel / Mistral3VLM
#6152
opened Jul 17, 2025 by
2ez4bz
Loading…
[TRTLLM-6537][infra] extend multi-gpu tests related file list
#6139
opened Jul 17, 2025 by
reasonsolo
Loading…
[nvbug/5322354] fix PD + MTP + overlap scheduler accuracy issue
#6136
opened Jul 17, 2025 by
yweng0828
Loading…
[TRTLLM-6549] chore: record delay introduced by disaggregated serving in kv cache measure
#6135
opened Jul 17, 2025 by
zhengd-nv
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.