[relay][frontend] aten::copy_ support for pytorch #15502

jhlee525 · 2023-08-07T12:42:06Z

Although #9375 has been rejected, I tried a different way to support aten::copy_ op.

aten::copy_ behaves differently from other inplace ops, "pure inplace" way, unlike other inplace nodes' one, which output graph(torch.Graph) still relaying it's output to users so that a DAG can be structed. However, aten::copy_ op returns itself, which dangles all of mutations.

For example, a torch module like

class Test(torch.nn.Module):
    def __init__(self) -> None:
        super().__init__()

    def forward(self, x: torch.Tensor):
        x[:5, :5] = x[:5, :5] + 1
        return x

generates the graph

graph(%self : __torch__.Test,
      %x : Float(10, 10, strides=[10, 1], requires_grad=0, device=cpu)):
  %4 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %5 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %6 : int = prim::Constant[value=5]() # /home/jhlee/tvm/test.py:10:0
  %7 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %8 : Float(5, 10, strides=[10, 1], requires_grad=0, device=cpu) = aten::slice(%x, %4, %5, %6, %7)
  %9 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %10 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %11 : int = prim::Constant[value=5]() # /home/jhlee/tvm/test.py:10:0
  %12 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %13 : Float(5, 5, strides=[10, 1], requires_grad=0, device=cpu) = aten::slice(%8, %9, %10, %11, %12)
  %14 : Long(requires_grad=0, device=cpu) = prim::Constant[value={1}]()
  %15 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %16 : Float(5, 5, strides=[5, 1], requires_grad=0, device=cpu) = aten::add(%13, %14, %15)
  %17 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %18 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %19 : int = prim::Constant[value=5]() # /home/jhlee/tvm/test.py:10:0
  %20 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %21 : Float(5, 10, strides=[10, 1], requires_grad=0, device=cpu) = aten::slice(%x, %17, %18, %19, %20)
  %22 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %23 : int = prim::Constant[value=0]() # /home/jhlee/tvm/test.py:10:0
  %24 : int = prim::Constant[value=5]() # /home/jhlee/tvm/test.py:10:0
  %25 : int = prim::Constant[value=1]() # /home/jhlee/tvm/test.py:10:0
  %26 : Float(5, 5, strides=[10, 1], requires_grad=0, device=cpu) = aten::slice(%21, %22, %23, %24, %25)
  %27 : bool = prim::Constant[value=0]()
  %28 : Float(5, 5, strides=[10, 1], requires_grad=0, device=cpu) = aten::copy_(%26, %16, %27)
  return (%x)

which returns %x itself.

My approach to handle this problem is:

in from_pytorch, insert a pass that redirects output of aten::copy_(_redirect_inplace_output), after _run_jit_passes is called, in torch level(torch.Graph)
when handling aten::copy node, we collect from it's parents to collect aten::select and aten::slice nodes, to generate indices of source. I referenced pytorch repository, behavior of torch -> onnx conversion

I'm not familiar with making a PR to this repository, so please let me know if there is any feedbacks or questions.

tvm-bot · 2023-08-07T12:42:09Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @shingjan _{See #10317 for details}

_{Generated by tvm-bot}

jhlee525 · 2023-08-17T05:56:40Z

Is this PR can be merged to main, in regards to design of relay architecture?
The test seems failed in CI server, but it is hard to know what the problem is. In my local, my test case passes without any fault. I tried to figure out what was happened in CI server, but it seems the log doesn't show enough message. Probably it's segmentation fault. Could anybody give me an advice to handle this problem?

rebel-jhlee · 2023-08-17T06:47:20Z

@masahi

masahi · 2023-08-17T06:55:51Z

You've got a segfault from your test.

rebel-jhlee · 2023-10-13T05:03:36Z

@masahi It's ready for review. Sorry for late response.

masahi · 2023-10-13T05:44:50Z

python/tvm/relay/frontend/pytorch.py

@@ -4470,6 +4558,26 @@ def _run_jit_passes(graph, enable_lower_all_tuples=True):
        torch._C._jit_pass_lower_all_tuples(graph)


+def _redirect_inplace_output(graph):


Please give an example of what this pass does, by documenting IR before / after this pass.

Ok An example added

masahi · 2023-10-13T05:48:34Z

tests/python/frontend/pytorch/test_forward.py

+            return x
+
+    inputs = torch.randn(10, 10)
+    verify_model(InplaceCopy(), [inputs])


Please add more tests, using various tricky examples to make sure that the conversion works.

I added more test, tried to test this function with various case. Please let me know about any suggestions

masahi

Ok, let's try this approach for our first copy_ support.

rebel-jhlee added 3 commits August 7, 2023 20:17

add handling logic for aten::copy_

98f470f

lint

5c22635

add test case

f37050e

jhlee525 changed the title ~~aten::copy_ support in pytorch frontend~~ [relay][frontend]aten::copy_ support in pytorch frontend Aug 7, 2023

jhlee525 changed the title ~~[relay][frontend]aten::copy_ support in pytorch frontend~~ [relay][frontend] aten::copy_ support for pytorch Aug 7, 2023

rebel-jhlee added 2 commits August 8, 2023 13:23

lint

aed8863

remove __init__

3210e94

rebel-jhlee added 5 commits October 11, 2023 18:06

fix logic

4aac330

lint

ccec5c0

lint

3c661a9

lint

b8b3aae

Merge remote-tracking branch 'upstream/main' into aten-copy-support

76fb3ca

masahi reviewed Oct 13, 2023

View reviewed changes

rebel-jhlee added 2 commits October 17, 2023 16:30

feedback

8f1735e

lint

7d7da85

jhlee525 requested a review from masahi October 18, 2023 02:35

Merge remote-tracking branch 'upstream/main' into aten-copy-support

50afccc

masahi approved these changes Oct 18, 2023

View reviewed changes

masahi merged commit b7aada1 into apache:main Oct 19, 2023

ysh329 mentioned this pull request Jan 11, 2024

[Release] v0.15.0 Release Candidate Notes #16391

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[relay][frontend] aten::copy_ support for pytorch #15502

[relay][frontend] aten::copy_ support for pytorch #15502

jhlee525 commented Aug 7, 2023

tvm-bot commented Aug 7, 2023 •

edited

Loading

jhlee525 commented Aug 17, 2023

rebel-jhlee commented Aug 17, 2023

masahi commented Aug 17, 2023

rebel-jhlee commented Oct 13, 2023

masahi Oct 13, 2023

jhlee525 Oct 18, 2023

masahi Oct 13, 2023

jhlee525 Oct 18, 2023

masahi Oct 18, 2023

masahi left a comment

		@@ -4470,6 +4558,26 @@ def _run_jit_passes(graph, enable_lower_all_tuples=True):
		torch._C._jit_pass_lower_all_tuples(graph)


		def _redirect_inplace_output(graph):

[relay][frontend] aten::copy_ support for pytorch #15502

[relay][frontend] aten::copy_ support for pytorch #15502

Conversation

jhlee525 commented Aug 7, 2023

tvm-bot commented Aug 7, 2023 • edited Loading

jhlee525 commented Aug 17, 2023

rebel-jhlee commented Aug 17, 2023

masahi commented Aug 17, 2023

rebel-jhlee commented Oct 13, 2023

masahi Oct 13, 2023

Choose a reason for hiding this comment

jhlee525 Oct 18, 2023

Choose a reason for hiding this comment

masahi Oct 13, 2023

Choose a reason for hiding this comment

jhlee525 Oct 18, 2023

Choose a reason for hiding this comment

masahi Oct 18, 2023

Choose a reason for hiding this comment

masahi left a comment

Choose a reason for hiding this comment

tvm-bot commented Aug 7, 2023 •

edited

Loading