Skip to content

Add torchao quant (int4/int8/fp8) to llama models #155

Add torchao quant (int4/int8/fp8) to llama models

Add torchao quant (int4/int8/fp8) to llama models #155

Annotations

1 warning

performance-test-1-gpu

succeeded Sep 9, 2024 in 13m 21s