[TensorRT] Add transpose_a/b for TensorRT batch_matmul #8607

ymwangg · 2021-07-30T18:00:29Z

This PR added transpose_a/b for TensorRT batch_matmul, fixed a warning and compilation error with TensorRT-8. It also removed the redundant transpose op in onnx matmul. Tested with both TensorRT-7 and TensorRT-8.
cc @trevor-m @comaniac

trevor-m

Thanks @ymwangg! Left some minor comments in review.

trevor-m · 2021-07-30T18:36:10Z

tests/python/contrib/test_tensorrt.py

        x = relay.var("x", shape=(x_shape), dtype="float32")
        y = relay.var("y", shape=(y_shape), dtype="float32")
-        out = relay.nn.batch_matmul(x, y)
+        out = relay.nn.batch_matmul(
+            relay.transpose(x, [0, 2, 1]) if transa else x,


I don't think you need these relay.transpose on the inputs to test functionality of transa/transb args.

Good point, I've changed to using x/y_shape instead.

src/runtime/contrib/tensorrt/tensorrt_logger.h

comaniac

LGTM. Leave to @trevor-m to merge after addressing the comments.

jcf94 · 2021-08-02T02:20:12Z

python/tvm/relay/frontend/onnx.py

-                # Transpose matrix dimensions of b.
-                b = _op.transpose(b, [0, 2, 1])
                # Perform a batch matmul.
-                output = _op.nn.batch_matmul(a, b)
+                output = _op.nn.batch_matmul(a, b, transpose_b=False)


Thanks! @ymwangg

Just a little concern about changing the default behavior of framework frontend, since currently the default topi schedule support for NN format is not as strong as the original NT one.
This may cause confusions to those who have used onnx frontend before or who is using onnx frontend now.

To give an example, I've added an extra config to TensorFlow frontend which uses the NT format by default but provides an option to use the normal format. I think that would be better before we have prepared a strong enough topi.

p.s.: You see, I've also kept the default layout for nn.batch_matmul to be the original NT.

tvm/python/tvm/relay/frontend/tensorflow_ops.py

Lines 1191 to 1199 in 7653972

if TF_DEFAULT_CONFIGS["use_nt_batch_matmul"]:

# Strictly convert all batch_matmul to NT format

input_x = _op.transpose(input_x, axes=[0, 2, 1]) if adj_x else input_x

input_y = _op.transpose(input_y, axes=[0, 2, 1]) if not adj_y else input_y

ret = get_relay_op("batch_matmul")(input_x, input_y)

else:

ret = get_relay_op("batch_matmul")(

input_x, input_y, transpose_a=adj_x, transpose_b=adj_y

)

@jcf94 Thanks for the pointer. I will refactor to make NN optional.

ymwangg · 2021-08-05T00:15:27Z

@trevor-m @jcf94 Please review again when you get a chance.

jcf94

LGTM. Thanks! @ymwangg

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

ymwangg requested review from areusch, comaniac, Huyuwei, jroesch, junrushao, jwfromm, kazum, liangfu, masahi, mbrookhart, merrymercy, siju-samuel, srkreddy1238, tmoreau89, tqchen, vinx13, yzhliu and ZihengJiang as code owners July 30, 2021 18:00

trevor-m requested changes Jul 30, 2021

View reviewed changes

trevor-m self-assigned this Jul 30, 2021

comaniac approved these changes Jul 30, 2021

View reviewed changes

jcf94 reviewed Aug 2, 2021

View reviewed changes

ymwangg added 2 commits August 4, 2021 23:36

Add transpose support for tensorrt batch_matmul

f70d515

Address PR comment

cc2e6c9

ymwangg force-pushed the matmul-trt branch from 67a6816 to 84f0964 Compare August 4, 2021 23:50

Refactor to add ONNX_DEFAULT_CONFIGS

7095914

ymwangg force-pushed the matmul-trt branch from 9063287 to 7095914 Compare August 5, 2021 00:09

jcf94 approved these changes Aug 5, 2021

View reviewed changes

trevor-m approved these changes Aug 5, 2021

View reviewed changes

trevor-m merged commit 26c2a9a into apache:main Aug 5, 2021

mehrdadh pushed a commit to mehrdadh/tvm that referenced this pull request Aug 11, 2021

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

6f8decd

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

ylc pushed a commit to ylc/tvm that referenced this pull request Sep 29, 2021

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

ecfbedc

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

ylc pushed a commit to ylc/tvm that referenced this pull request Jan 13, 2022

[TensorRT] Add transpose_a/b for TensorRT batch_matmul (apache#8607)

81949ad

* Add transpose support for tensorrt batch_matmul * Address PR comment * Refactor to add ONNX_DEFAULT_CONFIGS

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TensorRT] Add transpose_a/b for TensorRT batch_matmul #8607

[TensorRT] Add transpose_a/b for TensorRT batch_matmul #8607

ymwangg commented Jul 30, 2021 •

edited

Loading

trevor-m left a comment

trevor-m Jul 30, 2021

ymwangg Jul 30, 2021

comaniac left a comment

jcf94 Aug 2, 2021 •

edited

Loading

ymwangg Aug 2, 2021

ymwangg commented Aug 5, 2021

jcf94 left a comment

	if TF_DEFAULT_CONFIGS["use_nt_batch_matmul"]:
	# Strictly convert all batch_matmul to NT format
	input_x = _op.transpose(input_x, axes=[0, 2, 1]) if adj_x else input_x
	input_y = _op.transpose(input_y, axes=[0, 2, 1]) if not adj_y else input_y
	ret = get_relay_op("batch_matmul")(input_x, input_y)
	else:
	ret = get_relay_op("batch_matmul")(
	input_x, input_y, transpose_a=adj_x, transpose_b=adj_y
	)

[TensorRT] Add transpose_a/b for TensorRT batch_matmul #8607

[TensorRT] Add transpose_a/b for TensorRT batch_matmul #8607

Conversation

ymwangg commented Jul 30, 2021 • edited Loading

trevor-m left a comment

Choose a reason for hiding this comment

trevor-m Jul 30, 2021

Choose a reason for hiding this comment

ymwangg Jul 30, 2021

Choose a reason for hiding this comment

comaniac left a comment

Choose a reason for hiding this comment

jcf94 Aug 2, 2021 • edited Loading

Choose a reason for hiding this comment

ymwangg Aug 2, 2021

Choose a reason for hiding this comment

ymwangg commented Aug 5, 2021

jcf94 left a comment

Choose a reason for hiding this comment

ymwangg commented Jul 30, 2021 •

edited

Loading

jcf94 Aug 2, 2021 •

edited

Loading