Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Core] Default to using per_token quantization for fp8 when cutlass i…
…s supported. (vllm-project#8651) Signed-off-by: mgoin <michael@neuralmagic.com> Co-authored-by: Michael Goin <mgoin@redhat.com> Co-authored-by: mgoin <michael@neuralmagic.com>
- Loading branch information