-
Notifications
You must be signed in to change notification settings - Fork 36
Issues: ModelCloud/GPTQModel
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[BUG] 3bit quantization not working
bug
Something isn't working
#1207
opened Feb 3, 2025 by
sidhantls
[BUG] Bitblas Kernel not compatbile with bitblas > 0.0.1-dev13
bug
Something isn't working
#1192
opened Jan 31, 2025 by
Qubitium
[BUG] Quantized MoE model is generating invalid repsonse
bug
Something isn't working
#991
opened Jan 2, 2025 by
BodhiHu
Fix Qwen docs about gptq quantization quality + sharding + bad quants + autogptq
#817
opened Dec 11, 2024 by
Qubitium
[INTEGRATION] Expose stable kernel/packing/repacking apis
bug
Something isn't working
#726
opened Dec 3, 2024 by
wenhuach21
[FEATURE] Add Something isn't working
dynamic
suppor for AutoRound quantiztion
bug
#329
opened Aug 2, 2024 by
Qubitium
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.