[BYOC] Add GEMM kernel from FasterTransformer as submodule #15046

masahi · 2023-06-06T22:41:07Z

I extracted fp16 A - int8/4 GEMM kernel from FasterTransformer (see NVIDIA/cutlass#911) to make it easier to build and integrate into TVM. The code has been extracted and cleaned in the repo under tlc-pack and it is being added as a submodule.

A follow-up PR will update the CUTLASS BYOC to support offloading to this kernel. It is going to be useful for weight-quantized LLM inference.

Please review the license stuff etc @tqchen @junrushao @vinx13 @sunggg

tvm-bot · 2023-06-06T22:41:10Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @billishyahao _{See #10317 for details}

_{Generated by tvm-bot}

masahi · 2023-06-07T09:38:23Z

hmm the cutlass revision which is submoduled by https://github.com/tlc-pack/cutlass_fpA_intB_gemm is not pulled by the CI apparently. Does anyone know how to tell the CI to do git submodule update --init --recursive?

yzh119

LGTM

) * add submodule * update rev

masahi force-pushed the ft-kernel-submod branch from f3078bc to 4e43425 Compare June 8, 2023 09:15

masahi mentioned this pull request Jun 8, 2023

[CI] Clone submodule recursively #15062

Merged

masahi added 2 commits June 10, 2023 03:17

add submodule

fb8bf20

update rev

2639146

masahi force-pushed the ft-kernel-submod branch from 4e43425 to 2639146 Compare June 9, 2023 18:37

yzh119 approved these changes Jun 9, 2023

View reviewed changes

yzh119 merged commit d8e5812 into apache:main Jun 10, 2023

junrushao pushed a commit to junrushao/tvm that referenced this pull request Jun 22, 2023

[BYOC] Add GEMM kernel from FasterTransformer as submodule (apache#15046

3a0a7fe

) * add submodule * update rev

ysh329 mentioned this pull request Jul 12, 2023

[Release] v0.13.0 Release Candidate Notes #15295

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BYOC] Add GEMM kernel from FasterTransformer as submodule #15046

[BYOC] Add GEMM kernel from FasterTransformer as submodule #15046

masahi commented Jun 6, 2023 •

edited

Loading

tvm-bot commented Jun 6, 2023

masahi commented Jun 7, 2023 •

edited

Loading

yzh119 left a comment

[BYOC] Add GEMM kernel from FasterTransformer as submodule #15046

[BYOC] Add GEMM kernel from FasterTransformer as submodule #15046

Conversation

masahi commented Jun 6, 2023 • edited Loading

tvm-bot commented Jun 6, 2023

masahi commented Jun 7, 2023 • edited Loading

yzh119 left a comment

Choose a reason for hiding this comment

masahi commented Jun 6, 2023 •

edited

Loading

masahi commented Jun 7, 2023 •

edited

Loading