Skip to content

Conversation

@MasterJH5574
Copy link
Contributor

This PR introduces blockwise scale matmul and batch matmul CUTLASS kernels, adapted from SGLang (http://github.com/sgl-project/sglang), vLLM (https://github.com/vllm-project/vllm) and
https://github.com/soundOfDestiny/cutlass.

We add unit tests for gemm and bmm. This PR also restores some cutlass gemm tests that were removed before during Relay phasing out.

@MasterJH5574 MasterJH5574 force-pushed the tvm-dev/2025-03-29-cutlass-blockwise-gemm-bmm branch from fa7eeb8 to b25a365 Compare March 29, 2025 17:43
This PR introduces blockwise scale matmul and batch matmul CUTLASS
kernels, adapted from SGLang (http://github.com/sgl-project/sglang),
vLLM (https://github.com/vllm-project/vllm) and
https://github.com/soundOfDestiny/cutlass.

We add unit tests for gemm and bmm. This PR also restores some
cutlass gemm tests that were removed before during Relay phasing out.
@MasterJH5574 MasterJH5574 force-pushed the tvm-dev/2025-03-29-cutlass-blockwise-gemm-bmm branch from b25a365 to 5219852 Compare March 29, 2025 18:10
@yongwww yongwww merged commit b0ccfb3 into apache:main Mar 31, 2025
15 checks passed
ShiboXing pushed a commit to ShiboXing/tvm that referenced this pull request Aug 10, 2025
This PR introduces blockwise scale matmul and batch matmul CUTLASS
kernels, adapted from SGLang (http://github.com/sgl-project/sglang),
vLLM (https://github.com/vllm-project/vllm) and
https://github.com/soundOfDestiny/cutlass.

We add unit tests for gemm and bmm. This PR also restores some
cutlass gemm tests that were removed before during Relay phasing out.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants