extend matmul_int8 and support matmul_with_flatten_int8 #56827

YSF-A · 2023-08-31T01:17:21Z

PR types

New features

PR changes

Ops

Description

support transpose, broadcast and so on for matmul_int8
add matmul_with_flatten_int8

paddle-bot · 2023-08-31T01:17:25Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

CLAassistant · 2023-08-31T01:17:27Z

All committers have signed the CLA.

paddle-bot · 2023-08-31T01:17:28Z

✅ This PR's description meets the template requirements!
Please wait for other CI results.

RichardWooSJTU · 2023-08-31T06:04:31Z

paddle/phi/kernels/gpu/matmul_kernel.cu

@@ -43,3 +43,9 @@ PD_REGISTER_KERNEL(matmul_with_flatten,
                   double,
                   phi::dtype::bfloat16,
                   phi::dtype::float16) {}
+


期望不使用matmul_int8和matmul_with_flatten_int8独立于 matmul和matmul_with_flatten，而是通过增加数据类型的方式支持。之前的matmul_int8是一种非常临时的方式

目前修改了这个部分，按照上述这个方式进行了实现。

… add-int8-matmul

chenwhql

LGTM overall

chenwhql · 2023-09-13T03:50:57Z

paddle/fluid/operators/matmul_op.cc

+
+#if defined(PADDLE_WITH_CUDA)
+#if CUDA_VERSION >= 11060
+REGISTER_OP_CUDA_KERNEL(


这个op已经废弃了吧，还需要更新它的实现吗？

动转静之后仍然走了这里的实现

Ligoml

LGTM for docs

qili93

LGTM for unittest.skip

phlrain

LGTM for check dygraph = false

XiaoguangHu01

LGTM

…)" This reverts commit 5b6594c.

…#56827) * add matmul_int8 and matmul_with_flatten_int8 * fix api of int8 matmul and matmul_with_flatten * support dy2static * fix static matmul build * code refine * add cuda version limitation in matmulflatten * avoid rocm build error * fix build error * fix build errors: auto_tune and blaslt_impl * fix inference ci error * fix multidefinition error * fix matmul with flatten error * fix unit test --------- Co-authored-by: RichardWooSJTU <37864677+RichardWooSJTU@users.noreply.github.com> Co-authored-by: wufeisheng <wfs1997@163.com>

add matmul_int8 and matmul_with_flatten_int8

0474179

paddle-bot bot added the contributor External developers label Aug 31, 2023

RichardWooSJTU reviewed Aug 31, 2023

View reviewed changes

YSF-A and others added 14 commits September 2, 2023 18:41

fix api of int8 matmul and matmul_with_flatten

c52c39e

Merge branch 'develop' into add-int8-matmul

70734ba

support dy2static

ddb4f8a

fix static matmul build

c48f5ea

code refine

81e5b26

add cuda version limitation in matmulflatten

322898a

avoid rocm build error

e6773d2

fix build error

f8d30de

fix build errors: auto_tune and blaslt_impl

7e9f093

fix inference ci error

96581be

fix multidefinition error

f0af6a5

fix matmul with flatten error

e0f26a7

Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…

f1a01e3

… add-int8-matmul

fix unit test

18a3ea2

chenwhql approved these changes Sep 13, 2023

View reviewed changes

Ligoml approved these changes Sep 14, 2023

View reviewed changes

qili93 approved these changes Sep 14, 2023

View reviewed changes

phlrain approved these changes Sep 14, 2023

View reviewed changes

XiaoguangHu01 approved these changes Sep 14, 2023

View reviewed changes

carryyu merged commit 5b6594c into PaddlePaddle:develop Sep 14, 2023

risemeup1 mentioned this pull request Sep 14, 2023

Revert "extend matmul_int8 and support matmul_with_flatten_int8" #57346

Closed

risemeup1 added a commit that referenced this pull request Sep 14, 2023

Revert "extend matmul_int8 and support matmul_with_flatten_int8 (#56827…

abbac8a

…)" This reverts commit 5b6594c.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

extend matmul_int8 and support matmul_with_flatten_int8 #56827

extend matmul_int8 and support matmul_with_flatten_int8 #56827

YSF-A commented Aug 31, 2023 •

edited

Loading

paddle-bot bot commented Aug 31, 2023

CLAassistant commented Aug 31, 2023 •

edited

Loading

paddle-bot bot commented Aug 31, 2023 •

edited

Loading

RichardWooSJTU Aug 31, 2023

YSF-A Sep 2, 2023

chenwhql left a comment

chenwhql Sep 13, 2023

RichardWooSJTU Sep 13, 2023

Ligoml left a comment

qili93 left a comment

phlrain left a comment

XiaoguangHu01 left a comment

extend matmul_int8 and support matmul_with_flatten_int8 #56827

extend matmul_int8 and support matmul_with_flatten_int8 #56827

Conversation

YSF-A commented Aug 31, 2023 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented Aug 31, 2023

CLAassistant commented Aug 31, 2023 • edited Loading

paddle-bot bot commented Aug 31, 2023 • edited Loading

RichardWooSJTU Aug 31, 2023

Choose a reason for hiding this comment

YSF-A Sep 2, 2023

Choose a reason for hiding this comment

chenwhql left a comment

Choose a reason for hiding this comment

chenwhql Sep 13, 2023

Choose a reason for hiding this comment

RichardWooSJTU Sep 13, 2023

Choose a reason for hiding this comment

Ligoml left a comment

Choose a reason for hiding this comment

qili93 left a comment

Choose a reason for hiding this comment

phlrain left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

YSF-A commented Aug 31, 2023 •

edited

Loading

CLAassistant commented Aug 31, 2023 •

edited

Loading

paddle-bot bot commented Aug 31, 2023 •

edited

Loading