[Torch] Support Python list, more realistic recurrent networks #5306

masahi · 2020-04-11T07:26:12Z

This PR builds on the control flow support added in #4964 and aims to support more realistic recurrent networks than the simple one in #4964. Specifically the goal is to enable translating LSTM models in the PyTorch repo https://github.com/pytorch/pytorch/tree/master/benchmarks/fastrnns described in their blog post.

Translating these models requires taking care of dynamic lists and tensor shape. I added necessary support in the Torch frontend using prelude List ADT, static Tensor array #5103, and Any. Previously we can only translate "tensors in, tensors out" type of models, but now we can ingest more complex inputs such as list of tuple of tensors.

See the new test cases for the kinds of models we can support now. I added some variants of LSTMs:

LSTM with layer normalization
Bidirectional
Stacked
Stacked and Bidirectional

The result of translating three layer stacked bidirectional LSTMs is dumped here. Even though this model has three loop nest,

for layer in num_layers:
    for dir in ["forward", "backward"]:
        seq_len = input.size(0)
        for i in seq_len:
              ...

the two outer loops are unrolled by Torchscript. So in the Relay IR dump, there are 3 (layers) x 2 (direction) while loops.

please review @kevinthesun @zhiics @MarisaKirisame @icemelon9 @jwfromm @wweic @alexwong
cc @tqchen @jroesch @ajtulloch @junrushao1994

MarisaKirisame

LGTM. Is torchscript list immutable or mutable like python's list?

masahi · 2020-04-11T15:03:35Z

LGTM. Is torchscript list immutable or mutable like python's list?

Yes it is mutable. List append is mapped to aten::append in Torchscript and it is entirely side-effecting operation. See below for a simple module that only does list append and its Torchscript representation.

Even though variables that are updated in the loop are supposed to be passed to prim::Loop op to become loop variables, this does not apply to side effecting operations like list append. Translating this module to Relay is complicated because also in Relay loop variables are the only ones that can be updated between iteration. If we naively translate it, the list outputs.1 below appears free in the loop and can not be updated.

class ListAppend(nn.Module):
    def forward(self, input):
        # type: (Tensor) -> List[Tensor]
        outputs = []
        for i in range(input.size(0)):
            outputs.append(input)
        return outputs

graph(%self : __torch__.ListAppend,
      %input.1 : Tensor):
  %8 : bool = prim::Constant[value=1]() # rnn_test.py:142:8
  %4 : int = prim::Constant[value=0]() # rnn_test.py:142:34
  %outputs.1 : Tensor[] = prim::ListConstruct()
  %5 : int = aten::size(%input.1, %4) # rnn_test.py:142:23
   = prim::Loop(%5, %8) # rnn_test.py:142:8
    block0(%i : int):
      %12 : Tensor[] = aten::append(%outputs.1, %input.1) # rnn_test.py:143:12
      -> (%8)
  return (%outputs.1)

To workaround the difficulty of list append, I use list concat to append one element at the tail of a list. The original LSTM models in Pytorch repo do not use list append either and use concat instead, probably for the same reason.

MarisaKirisame · 2020-04-12T02:43:03Z

From an outsider, it seems like the more principled approach is to translate list to Reference of List. We could then write passses to remove Reference if possible.

kevinthesun

Overall lgtm. Some minor comments regarding to registering static tensor array ops.

python/tvm/relay/frontend/pytorch.py

masahi · 2020-04-13T00:46:47Z

@kevinthesun Thanks for the review! Please have a look at the last commit.

python/tvm/relay/frontend/pytorch.py

kevinthesun

LGTM

kevinthesun · 2020-04-13T06:12:14Z

Thanks @masahi @MarisaKirisame

…e#5306) * use funcs from prelude, pass around convert_map * get relay input type from user ishape * handle tuple unpack * experimenting with static tensor array * use prelude concat instead of cons + rev * minor clean up * fix layer norm conversion bug, unwrap tensor array * add infer shape on tensor array * pass around prelude for now * compile worked but runtime error * fix tensor array wrapping * begin list dynamic test * is_list_dynamic first version * finish dynamic list test * a few fix * use shape_of function if Any is found * improve size conversion * working on adding free vars to loop block * fixed inlined inner loop issue * clean up free var handling * add support for tensor array concat * adding ta concat on last axis * fix concat, but got runtime error * disable concat on axis -1 for now * add lstm tests * revert unrelated change * fix stacked bidir test * minor fix to test * relax tol a bit, revert dnnl change to avoid conflict * simplify infer type, use input tensor shape rather than concat shape * more shape fix

MarisaKirisame approved these changes Apr 11, 2020

View reviewed changes

masahi force-pushed the support-more-rnn branch from f1dedd4 to 8d56f33 Compare April 12, 2020 10:16

kevinthesun requested changes Apr 13, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Show resolved Hide resolved

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

python/tvm/relay/frontend/pytorch.py Show resolved Hide resolved

masahi added 25 commits April 13, 2020 09:21

use funcs from prelude, pass around convert_map

1c80a69

get relay input type from user ishape

ff440ea

handle tuple unpack

21be771

experimenting with static tensor array

cf0af1b

use prelude concat instead of cons + rev

24c22f7

minor clean up

8d59ae6

fix layer norm conversion bug, unwrap tensor array

edbc2a4

add infer shape on tensor array

f7ecc75

pass around prelude for now

b727182

compile worked but runtime error

a3319f3

fix tensor array wrapping

58e2908

begin list dynamic test

a973954

is_list_dynamic first version

0bef1fa

finish dynamic list test

0c56041

a few fix

bb85504

use shape_of function if Any is found

002eb4e

improve size conversion

3261466

working on adding free vars to loop block

0a14c19

fixed inlined inner loop issue

f2d8bd2

clean up free var handling

fd297ae

add support for tensor array concat

18aad7f

adding ta concat on last axis

a7c59ed

fix concat, but got runtime error

e9cb1a7

disable concat on axis -1 for now

a2b0da4

add lstm tests

eb70587

masahi added 5 commits April 13, 2020 09:22

revert unrelated change

f53cc0c

fix stacked bidir test

12740bb

minor fix to test

b7bce2b

relax tol a bit, revert dnnl change to avoid conflict

6bcf0f1

simplify infer type, use input tensor shape rather than concat shape

3074c9a

masahi force-pushed the support-more-rnn branch from 8d56f33 to 3074c9a Compare April 13, 2020 00:44

kevinthesun reviewed Apr 13, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

kevinthesun reviewed Apr 13, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

more shape fix

c185b4e

kevinthesun approved these changes Apr 13, 2020

View reviewed changes

kevinthesun merged commit 0145cd5 into apache:master Apr 13, 2020

masahi mentioned this pull request Apr 13, 2020

[PYTORCH]Reduce_ops support added #5308

Merged

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch] Support Python list, more realistic recurrent networks #5306

[Torch] Support Python list, more realistic recurrent networks #5306

masahi commented Apr 11, 2020 •

edited

Loading

MarisaKirisame left a comment

masahi commented Apr 11, 2020 •

edited

Loading

MarisaKirisame commented Apr 12, 2020

kevinthesun left a comment

masahi commented Apr 13, 2020

kevinthesun left a comment

kevinthesun commented Apr 13, 2020

[Torch] Support Python list, more realistic recurrent networks #5306

[Torch] Support Python list, more realistic recurrent networks #5306

Conversation

masahi commented Apr 11, 2020 • edited Loading

MarisaKirisame left a comment

Choose a reason for hiding this comment

masahi commented Apr 11, 2020 • edited Loading

MarisaKirisame commented Apr 12, 2020

kevinthesun left a comment

Choose a reason for hiding this comment

masahi commented Apr 13, 2020

kevinthesun left a comment

Choose a reason for hiding this comment

kevinthesun commented Apr 13, 2020

masahi commented Apr 11, 2020 •

edited

Loading

masahi commented Apr 11, 2020 •

edited

Loading