Skip to content

Commit daaec73

Browse files
Fix pre-commit issues
- Remove weight_dtype check (attribute doesn't exist in config) - Fix markdown trailing space - Error handling is done at kernel level instead Signed-off-by: padg9912 <phone.and.desktop@gmail.com>
1 parent 7bfb572 commit daaec73

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

docs/features/quantization/int8.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ Please visit the HF collection of [quantized INT8 checkpoints of popular LLMs re
99
INT8 computation is supported on NVIDIA GPUs with compute capability > 7.5 (Turing, Ampere, Ada Lovelace, Hopper).
1010

1111
!!! warning
12-
**Blackwell GPU Limitation**: INT8 is not supported on compute capability >= 100 (e.g., RTX 6000 Blackwell).
12+
**Blackwell GPU Limitation**: INT8 is not supported on compute capability >= 100 (e.g., RTX 6000 Blackwell).
1313
Use [FP8 quantization](fp8.md) instead, or run on Hopper/Ada/Ampere architectures.
1414

1515
## Prerequisites

0 commit comments

Comments
 (0)