support inference for quantized matmul_v2 #36594

XGZhang11 · 2021-10-20T15:17:38Z

PR types

New features

PR changes

Others

Describe

支持了量化后的matmul_v2在tensorrt上的推理

CLAassistant · 2021-10-20T15:17:45Z

All committers have signed the CLA.

paddle-bot-old · 2021-10-20T15:18:27Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

qingqing01

need to add unit testing

wanghaoshuang · 2021-10-21T05:59:41Z

paddle/fluid/framework/ir/quant_conv2d_dequant_fuse_pass.cc

@@ -621,7 +642,8 @@ void QuantDequantFusePass::ApplyImpl(ir::Graph* graph) const {
  std::unordered_set<std::string> quant_types = {
      "fake_quantize_range_abs_max", "fake_quantize_moving_average_abs_max"};
  std::unordered_set<std::string> quantized_op_types = {
-      "conv2d", "mul", "matmul", "depthwise_conv2d", "fc", "conv2d_transpose"};
+      "conv2d",           "mul", "matmul",          "matmul_v2",


这里多余的空格处理一下。

这里代码风格要求是第一行放满，然后上下对齐，所以空格无法去掉，也无法写成每个单独成行

LiuChiachi

LGTM

Wangzheee · 2021-10-21T07:32:14Z

LGTM

XGZhang11 · 2021-10-21T09:07:39Z

need to add unit testing
inference正在计划在CI中增加bert的fp32和int8推理

ceci3

LGTM

* support inference for quantized matmul_v2 * undate code style * code style

support inference for quantized matmul_v2

ce1e650

XGZhang11 force-pushed the infer_for_matmul_v2 branch from 4968505 to ce1e650 Compare October 20, 2021 15:39

qingqing01 requested review from wanghaoshuang, shangzhizhou, LiuChiachi, qingqing01 and ceci3 October 21, 2021 05:51

qingqing01 reviewed Oct 21, 2021

View reviewed changes

wanghaoshuang reviewed Oct 21, 2021

View reviewed changes

LiuChiachi previously approved these changes Oct 21, 2021

View reviewed changes

undate code style

0c74505

XGZhang11 dismissed LiuChiachi’s stale review via 0c74505 October 21, 2021 09:04

code style

b7fdf44

ceci3 approved these changes Oct 25, 2021

View reviewed changes

ceci3 merged commit b151a45 into PaddlePaddle:develop Oct 28, 2021

ghost pushed a commit to piotrekobi/Paddle that referenced this pull request Nov 3, 2021

support inference for quantized matmul_v2 (PaddlePaddle#36594)

47e2981

* support inference for quantized matmul_v2 * undate code style * code style

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support inference for quantized matmul_v2 #36594

support inference for quantized matmul_v2 #36594

XGZhang11 commented Oct 20, 2021

CLAassistant commented Oct 20, 2021 •

edited

Loading

paddle-bot-old bot commented Oct 20, 2021

qingqing01 left a comment

wanghaoshuang Oct 21, 2021

XGZhang11 Oct 21, 2021

XGZhang11 Oct 21, 2021

LiuChiachi left a comment

Wangzheee commented Oct 21, 2021

XGZhang11 commented Oct 21, 2021

ceci3 left a comment

support inference for quantized matmul_v2 #36594

support inference for quantized matmul_v2 #36594

Conversation

XGZhang11 commented Oct 20, 2021

PR types

PR changes

Describe

CLAassistant commented Oct 20, 2021 • edited Loading

paddle-bot-old bot commented Oct 20, 2021

qingqing01 left a comment

Choose a reason for hiding this comment

wanghaoshuang Oct 21, 2021

Choose a reason for hiding this comment

XGZhang11 Oct 21, 2021

Choose a reason for hiding this comment

XGZhang11 Oct 21, 2021

Choose a reason for hiding this comment

LiuChiachi left a comment

Choose a reason for hiding this comment

Wangzheee commented Oct 21, 2021

XGZhang11 commented Oct 21, 2021

ceci3 left a comment

Choose a reason for hiding this comment

CLAassistant commented Oct 20, 2021 •

edited

Loading