Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama3.1 and kv_cache quantization #738

Merged
merged 4 commits into from
Aug 27, 2024

fix benchmarks

c5e4dcb
Select commit
Loading
Failed to load commit list.
Merged

Llama3.1 and kv_cache quantization #738

fix benchmarks
c5e4dcb
Select commit
Loading
Failed to load commit list.