-
Notifications
You must be signed in to change notification settings - Fork 205
Pull requests: NVIDIA/TensorRT-Model-Optimizer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump the pip group across 4 directories with 2 updates
dependencies
Pull requests that update a dependency file
python
Pull requests that update python code
#646
opened Dec 4, 2025 by
dependabot
bot
Loading…
Optimize calibrate_draft_vocab to read only required lines when calib…
#618
opened Nov 27, 2025 by
Ofir408
Loading…
Convert compressed-tensor int4 format to GPTQ int4 format
#590
opened Nov 20, 2025 by
Edwardf0t1
Loading…
Product Rename: TensorRT Model Optimizer to Model Optimizer
#583
opened Nov 20, 2025 by
kevalmorabia97
Loading…
2 tasks done
[OMNIML-2852] [2/n] Add Core Sparse Attention Infrastructure
#527
opened Nov 7, 2025 by
kaix-nv
Loading…
[Draft] [5526696] Add kv cache quantization support for onnx quantization
#486
opened Oct 31, 2025 by
zhanghaoc
Loading…
Preserve original rope scaling type in export due to transformers library AutoConfig issue
#452
opened Oct 17, 2025 by
Edwardf0t1
Loading…
[1/2] Registry interface for custom quantization functional backend
#449
opened Oct 17, 2025 by
realAsma
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.