Skip to content

Conversation

@lgeiger
Copy link
Contributor

@lgeiger lgeiger commented Oct 8, 2025

_w8a8_block_fp8_matmul seems to have been renamed to _w8a8_triton_block_scaled_mm in #25696. /cc @ElizaWszola

This PR adjusts the import in the FP8 tuning script accordingly.

Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
@mergify mergify bot added the performance Performance-related issues label Oct 8, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request aims to fix an import in a benchmark script due to a function rename. However, the new function name appears to be incorrect, which will cause the benchmark to fail. I've provided suggestions to use the correct Python wrapper function instead of the raw Triton kernel.

@lgeiger
Copy link
Contributor Author

lgeiger commented Oct 8, 2025

@DarkLight1337 Do you mind reviewing this PR as well?

@DarkLight1337 DarkLight1337 enabled auto-merge (squash) October 8, 2025 14:21
@github-actions github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 8, 2025
@DarkLight1337 DarkLight1337 merged commit 6273fe8 into vllm-project:main Oct 8, 2025
22 checks passed
mrasquinha-g pushed a commit to mrasquinha-g/vllm that referenced this pull request Oct 9, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
zhiyuan1i pushed a commit to zhiyuan1i/vllm that referenced this pull request Oct 9, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 10, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Dhruvilbhatt pushed a commit to Dhruvilbhatt/vllm that referenced this pull request Oct 14, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>
lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Oct 24, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
Signed-off-by: Lukas Geiger <lukas.geiger94@gmail.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

performance Performance-related issues ready ONLY add when PR is ready to merge/full CI is needed

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants