Skip to content

Commit

Permalink
[CI] Add smoke test for non-uniform AutoFP8 quantization (vllm-projec…
Browse files Browse the repository at this point in the history
…t#6702)

Signed-off-by: Alvant <alvasian@yandex.ru>
  • Loading branch information
mgoin authored and Alvant committed Oct 26, 2024
1 parent fc1dca2 commit 5e9e9cf
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions tests/quantization/test_fp8.py
Original file line number Diff line number Diff line change
Expand Up @@ -13,6 +13,7 @@
MODELS = [
"neuralmagic/Meta-Llama-3-8B-Instruct-FP8-KV",
"nm-testing/Phi-3-mini-128k-instruct-FP8",
"nm-testing/Qwen2-0.5B-Instruct-FP8-SkipQKV",
]


Expand Down

0 comments on commit 5e9e9cf

Please sign in to comment.