[RELAY][OP] Dynamic conv2d batch size for cuda #6598

zhiics · 2020-09-30T15:42:42Z

This PR enables dynamic conv2d for CUDA.

CC @kevinthesun @icemelon9 @mbrookhart @comaniac

zhiics · 2020-09-30T15:43:22Z

python/tvm/topi/cuda/conv2d_nhwc_winograd.py

@@ -432,7 +441,8 @@ def nhwc_winograd_cuda(
        name="output",
        tag="conv2d_nhwc_winograd",
    )
-    cfg.add_flop(2 * N * CO * H * W * CI * KH * KW)
+    if isinstance(N, int):
+        cfg.add_flop(2 * N * CO * H * W * CI * KH * KW)


@kevinthesun @icemelon9 @comaniac is this okay to autotvm?

It's okay in terms of the functionality, but the output message would be weird. Since the AutoTVM progress bar shows throughput instead of latency, users will always see 0 GFLOPS during the tuning process (https://github.com/apache/incubator-tvm/blob/master/python/tvm/autotvm/tuner/callback.py#L159).

Maybe we can still have the FLOPS with N=1 and pop a message saying we are tuning the kernel with N=1 but it can be used by the kernel with any batch size?

yeah, I thought about 1 as well. But it actually maybe not 1

I think it's fine since generally AutoTVM can't be used for dynamic shape op. User won't see any flops info when N is symbolic.

python/tvm/topi/cuda/conv2d_nhwc_winograd.py

comaniac · 2020-09-30T17:45:36Z

python/tvm/topi/cuda/conv2d_nhwc_winograd.py

@@ -432,7 +441,8 @@ def nhwc_winograd_cuda(
        name="output",
        tag="conv2d_nhwc_winograd",
    )
-    cfg.add_flop(2 * N * CO * H * W * CI * KH * KW)
+    if isinstance(N, int):
+        cfg.add_flop(2 * N * CO * H * W * CI * KH * KW)


It's okay in terms of the functionality, but the output message would be weird. Since the AutoTVM progress bar shows throughput instead of latency, users will always see 0 GFLOPS during the tuning process (https://github.com/apache/incubator-tvm/blob/master/python/tvm/autotvm/tuner/callback.py#L159).

Maybe we can still have the FLOPS with N=1 and pop a message saying we are tuning the kernel with N=1 but it can be used by the kernel with any batch size?

python/tvm/topi/cuda/conv2d_winograd.py

kevinthesun

LGTM

comaniac · 2020-10-01T07:47:17Z

Thanks @zhiics @kevinthesun

dynamic conv2d for cuda

0aabc5f

zhiics commented Sep 30, 2020

View reviewed changes

comaniac reviewed Sep 30, 2020

View reviewed changes

comaniac changed the title ~~[RELAY][OP] Dynamic conv2d for cuda~~ [RELAY][OP] Dynamic conv2d batch size for cuda Sep 30, 2020

kevinthesun approved these changes Oct 1, 2020

View reviewed changes

comaniac approved these changes Oct 1, 2020

View reviewed changes

comaniac merged commit e78aa61 into apache:master Oct 1, 2020

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Oct 13, 2020

dynamic conv2d for cuda (apache#6598)

e952cdc

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Oct 14, 2020

dynamic conv2d for cuda (apache#6598)

c652ce2

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Oct 15, 2020

dynamic conv2d for cuda (apache#6598)

736a844

TusharKanekiDey pushed a commit to TusharKanekiDey/tvm that referenced this pull request Oct 16, 2020

dynamic conv2d for cuda (apache#6598)

1455970

zhiics deleted the dynamic_conv2d_cuda branch October 17, 2020 00:17

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Oct 19, 2020

dynamic conv2d for cuda (apache#6598)

b5ef082

masahi mentioned this pull request Jan 4, 2021

[CUBLAS, CUDNN] Support dynamic batch size #7194

Merged

Johnson9009 mentioned this pull request Aug 2, 2021

[Refactor] Avoid Override Generic Op Strategy in "hls.py" #8614

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RELAY][OP] Dynamic conv2d batch size for cuda #6598

[RELAY][OP] Dynamic conv2d batch size for cuda #6598

zhiics commented Sep 30, 2020

zhiics Sep 30, 2020

comaniac Sep 30, 2020

zhiics Sep 30, 2020 •

edited

Loading

kevinthesun Oct 1, 2020 •

edited

Loading

comaniac Sep 30, 2020

kevinthesun left a comment

comaniac commented Oct 1, 2020

[RELAY][OP] Dynamic conv2d batch size for cuda #6598

[RELAY][OP] Dynamic conv2d batch size for cuda #6598

Conversation

zhiics commented Sep 30, 2020

zhiics Sep 30, 2020

Choose a reason for hiding this comment

comaniac Sep 30, 2020

Choose a reason for hiding this comment

zhiics Sep 30, 2020 • edited Loading

Choose a reason for hiding this comment

kevinthesun Oct 1, 2020 • edited Loading

Choose a reason for hiding this comment

comaniac Sep 30, 2020

Choose a reason for hiding this comment

kevinthesun left a comment

Choose a reason for hiding this comment

comaniac commented Oct 1, 2020

zhiics Sep 30, 2020 •

edited

Loading

kevinthesun Oct 1, 2020 •

edited

Loading