Skip to content

Commit 6273fe8

Browse files
authored
[Benchmarks] Fix imports in FP8 tuning script (#26407)
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
1 parent 9fb3ae4 commit 6273fe8

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

benchmarks/kernels/benchmark_w8a8_block_fp8.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -14,7 +14,7 @@
1414
from tqdm import tqdm
1515

1616
from vllm.model_executor.layers.quantization.utils.fp8_utils import (
17-
_w8a8_block_fp8_matmul,
17+
_w8a8_triton_block_scaled_mm,
1818
)
1919
from vllm.platforms import current_platform
2020
from vllm.triton_utils import triton
@@ -83,7 +83,7 @@ def grid(META):
8383
)
8484

8585
if A.dtype == torch.float8_e4m3fn:
86-
kernel = _w8a8_block_fp8_matmul
86+
kernel = _w8a8_triton_block_scaled_mm
8787
else:
8888
raise RuntimeError("Currently, only support tune w8a8 block fp8 kernel.")
8989

0 commit comments

Comments
 (0)