Skip to content

b2524

Compare
Choose a tag to compare
@github-actions github-actions released this 25 Mar 07:07
7733f0c
ggml : support AVX512VNNI (#6280)

This change causes some quants (e.g. Q4_0, Q8_0) to go faster on some
architectures (e.g. AMD Zen 4).