Skip to content

Q8_0: unbreak AVX #1117

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from
Closed

Q8_0: unbreak AVX #1117

wants to merge 1 commit into from

Conversation

sw
Copy link
Contributor

@sw sw commented Apr 22, 2023

#1109 was not finished for AVX (note: that affects all quantized formats, not just Q4_3 as the summary would suggest). This fixes it by introducing hsum_i32_4, in order to calculate s0 and s1.

@sw sw closed this Apr 22, 2023
@sw sw deleted the q8-avx branch April 22, 2023 08:11
@ggerganov
Copy link
Member

I added commented flags to the Makefile that can be used to go in AVX-only mode for easier debugging in the future:

https://github.com/ggerganov/llama.cpp/blob/master/Makefile#L79-L83

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants