Conv2d_transpose kernel 2x2, strides (2,2) fails for CUDA - Cannot prove #4470

apivovarov · 2019-12-05T02:09:41Z

I found that conv2d_transpose op fails when kernel size is 2x2 and strides are (2,2).
Errors

tvm/src/pass/loop_partition.cc:544: Cannot prove: ((((floordiv((((40 - (floordiv((dh + 1), 2)*8))*(5 - floordiv((dw + 1), 2))) + 63), 64) - 1) - (((40 - (floordiv((dh + 1), 2)*8))*(5 - floordiv((dw + 1), 2))) - select((-63 <= ((40 - (floordiv((dh + 1), 2)*8))*(5 - floordiv((dw + 1), 2)))), (floordiv((((40 - (floordiv((dh + 1), 2)*8))*(5 - floordiv((dw + 1), 2))) + 63), 64)*63), 0))) + 1) >= 0), when generating the post doubt loop

  File "/root/workplace/tvm/src/pass/split_host_device.cc", line 135
TVMError: Check failed: !use_count_.count(v): variable dh has been used before definition!
During handling of the above exception, another exception occurred:

To reproduce the error

import tvm
from tvm import relay
import tensorflow as tf
input_tensor = "input_1"
# NHWC
input_shape=(1,16,16,8)
x = tf.compat.v1.placeholder(tf.float32, shape=input_shape, name=input_tensor)
# HWOI
w2 = tf.compat.v1.placeholder(tf.float32, shape=(2,2,3,8))
out_shape = tf.compat.v1.placeholder(tf.int32, shape=(4))
deconv = tf.compat.v1.nn.conv2d_transpose(x, w2, out_shape, (1,2,2,1), padding='VALID')

sess = tf.compat.v1.Session()
graph_def = sess.graph_def

mod, params = relay.frontend.from_tensorflow(graph_def, layout='NCHW', shape={input_tensor: input_shape})

target = tvm.target.cuda()
from tvm.autotvm.measure.measure_methods import set_cuda_target_arch
set_cuda_target_arch('sm_70')

with relay.build_config(opt_level=3):
    graph, lib, params = relay.build(mod, target, params=params)

I tried other targets - llvm and arm_cpu - they are working fine. Only cuda fails.

Related to PR 4243

Discussions:
https://discuss.tvm.ai/t/conv2d-transpose-kernel-2x2-strides-2-2-fails-for-cuda-cannot-prove/5020

https://discuss.tvm.ai/t/compile-error-for-cuda-target/4164

The text was updated successfully, but these errors were encountered:

apivovarov · 2019-12-06T03:13:14Z

Workaround PR #4472 @vinx13 @tqchen @merrymercy @Huyuwei @optima2005

apivovarov mentioned this issue Dec 6, 2019

Workaround to make conv2d_transpose compilation for CUDA work #4472

Merged

optima2005 mentioned this issue Dec 11, 2019

[FRONTEND][TF] conv2d_transpose 'SAME' support kernel more than 1x1 #4484

Merged

tqchen closed this as completed Dec 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conv2d_transpose kernel 2x2, strides (2,2) fails for CUDA - Cannot prove #4470

Conv2d_transpose kernel 2x2, strides (2,2) fails for CUDA - Cannot prove #4470

apivovarov commented Dec 5, 2019 •

edited

Loading

apivovarov commented Dec 6, 2019 •

edited

Loading

Conv2d_transpose kernel 2x2, strides (2,2) fails for CUDA - Cannot prove #4470

Conv2d_transpose kernel 2x2, strides (2,2) fails for CUDA - Cannot prove #4470

Comments

apivovarov commented Dec 5, 2019 • edited Loading

apivovarov commented Dec 6, 2019 • edited Loading

apivovarov commented Dec 5, 2019 •

edited

Loading

apivovarov commented Dec 6, 2019 •

edited

Loading