Replies: 1 comment 2 replies
-
I could be wrong but if it uses MMQ, isn't that automatically using the specialized instructions as soon as you have cuda 12.8 installed ? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Will llama.cpp take advantage of FP4 compute of the new chips? E.g. Blackwell GPUs.
Beta Was this translation helpful? Give feedback.
All reactions