-
Notifications
You must be signed in to change notification settings - Fork 190
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
modelopt nas search() implementation for the compress algorithm
#490
opened Oct 31, 2025 by
danielkorzekwa
Loading…
fix qdq utils issues and remove global cast replacements
#489
opened Oct 31, 2025 by
nvluxiaoz
Loading…
[Draft] [5526696] Add kv cache quantization support for onnx quantization
#486
opened Oct 31, 2025 by
zhanghaoc
Loading…
[OMNIML-2917] export layer config using actual prefix instead of hard…
#479
opened Oct 28, 2025 by
shengliangxu
Loading…
[5590225] Fixed regression introduced by PR #364 (FP64-to-FP32 conversion)
#462
opened Oct 24, 2025 by
gcunhase
Loading…
Add functional test cases for published checkpoints on HF
#455
opened Oct 21, 2025 by
noeyy-mino
Loading…
Preserve original rope scaling type in export due to transformers library AutoConfig issue
#452
opened Oct 17, 2025 by
Edwardf0t1
Loading…
[1/2] Registry interface for custom quantization functional backend
#449
opened Oct 17, 2025 by
realAsma
Loading…
[5271050, 5274346][ONNX] Add support for Conv-Act-Pool fusion
#448
opened Oct 17, 2025 by
gcunhase
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.