Skip to content

Commit ecbe466

Browse files
authored
Retire the ggml_mul_mat() branch for transposed src0 (#500)
* Retire the ggml_mul_mat() for transposed src0 - It can always be made contiguous with ggml_cpy() - The code is now simplified - The results are deterministic in respect to num threads * SIMD-ify dequantize_row_q4_0() for ARM_NEON (#502) * Attempt to SIMD-ify dequantize_row_q4_0() for ARM_NEON * Fix dequantization - forgot to interleave the quants
1 parent 502a400 commit ecbe466

File tree

1 file changed

+237
-716
lines changed

1 file changed

+237
-716
lines changed

0 commit comments

Comments
 (0)