Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Quantized KV cache: update quanto (huggingface#31052)
* quanto latest version was refactored * add error msg * incorrect compare sign * Update src/transformers/cache_utils.py Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com> --------- Co-authored-by: amyeroberts <22614925+amyeroberts@users.noreply.github.com>
- Loading branch information