ENH: Remove quantization limits for Apple METAL device when running model via llama-cpp-python
#1134
Merged
aresnow1 merged 1 commit intoxorbitsai:mainfrom ChengjieLi28:enh/support_more_gguf_quan_on_metalMar 13, 2024
+1-5
Commits
Commits on Mar 12, 2024
- committed