-
Notifications
You must be signed in to change notification settings - Fork 93
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
wandb/tensorboard loggers set default init to False
ready
When a PR is ready for review
#1235
opened Mar 7, 2025 by
brian-dellabetta
Loading…
[Quantization] Channel-wise Output Activation Quantization for Attention QKV Modules + KV-cache channel quantization
ready
When a PR is ready for review
fixing reproducibility of lmeval tests
ready
When a PR is ready for review
#1220
opened Mar 4, 2025 by
brian-dellabetta
Loading…
[Train] Training Pipeline
ready
When a PR is ready for review
#1214
opened Feb 28, 2025 by
horheynm
Loading…
[Callbacks][Docs] Add docstrings to saving functions
#1201
opened Feb 26, 2025 by
kylesayrs
Loading…
Use KV cache constant names provided by compressed tensors
#1200
opened Feb 26, 2025 by
kylesayrs
Loading…
Remove unused/duplicated/non-applicable utils from pytorch/utils/helpers
#1174
opened Feb 19, 2025 by
kylesayrs
Loading…
[Callbacks] Remove EventLifecycle and on_start event
#1170
opened Feb 19, 2025 by
kylesayrs
Loading…
[Callbacks] Remove double initialization, replace with updating the state directly
ready
When a PR is ready for review
#1169
opened Feb 19, 2025 by
kylesayrs
Loading…
Offload Cache Support torch.dtype
ready
When a PR is ready for review
#1141
opened Feb 12, 2025 by
kylesayrs
Loading…
Implement lazy loading for traceable models
ready
When a PR is ready for review
#1105
opened Jan 28, 2025 by
kylesayrs
Loading…
Previous Next
ProTip!
Adding no:label will show everything without a label.