[TVMC] Fix PyTorch support #7359

ekalda · 2021-01-28T13:34:10Z

A PyTorch model could not be compiled through tvmc because the shape
of the input tensor could not be deduced from the model after it has been
saved. We've added an --input-shape parameter to tvmc compile and
tvmc tune that allows the inputs to be specified for PyTorch models.

A PyTorch model could not be compiled throgh tvmc because the shape of the input tensor could not be deduced from the model after it has been saved. We've added an --input-shape parameter to tvmc compile and tvmc tune that allows the inputs to be specified for PyTorch models.

ekalda · 2021-01-28T15:24:49Z

cc @leandron @u99127

leandron

Thanks for this fix.

cc @masahi @comaniac to have a look.

leandron · 2021-01-28T15:43:27Z

python/tvm/driver/tvmc/common.py

+
+def parse_input_shapes(xs):
+    """Turn the string from --input-shape into a list.
+


It would be good to have an example here, that describes the input format and expected output format, similar to what you have on test_parse_input_shapes__turn_into_list.

leandron · 2021-01-28T15:47:49Z

python/tvm/driver/tvmc/compiler.py

+        "--input-shape",
+        type=common.parse_input_shapes,
+        metavar="INPUT_SHAPE,[INPUT_SHAPE]...",
+        help="for PyTorch, e.g. '(1,3,224,224)'",


Maybe clarify that it is in fact mandatory for PyTorch.

Agree. It's confusing to see such a general option only for PyTorch. I would suggest the following changes:

Make --input-shape as a general option for all frontends. If present, we skip the input shape inference.

--input-shape is optional by default. However, if users want to process a PyTorch model but don't specify --input-shape, we throw out an error in the PyTorch frontend.

comaniac · 2021-01-28T18:11:11Z

python/tvm/driver/tvmc/compiler.py

+        "--input-shape",
+        type=common.parse_input_shapes,
+        metavar="INPUT_SHAPE,[INPUT_SHAPE]...",
+        help="for PyTorch, e.g. '(1,3,224,224)'",


Agree. It's confusing to see such a general option only for PyTorch. I would suggest the following changes:

Make --input-shape as a general option for all frontends. If present, we skip the input shape inference.

--input-shape is optional by default. However, if users want to process a PyTorch model but don't specify --input-shape, we throw out an error in the PyTorch frontend.

comaniac · 2021-01-28T18:13:16Z

python/tvm/driver/tvmc/common.py

+                # Remove white space and extract numbers
+                strshape = shape[1].replace(" ", "").split(",")


It would be safer and easier to remove all spaces in xs in the beginning of this function.

comaniac · 2021-01-28T18:29:05Z

python/tvm/driver/tvmc/common.py

+                try:
+                    shapes.append([int(i) for i in strshape])
+                except ValueError:
+                    raise argparse.ArgumentTypeError(f"expected numbers in shape '{shape[1]}'")


Consider the following two input shapes:

(8): shapes=[8]

(8,): Value error because strshape would be [8, ""].

Accordingly, I guess your intention is (8) instead of (8,). However, this is inconsistent with the Python syntax so it might confuse people. I have two proposals to deal with this:

Use list syntax instead of tuple, so that the semantic is clear, and we can simply use JSON loader to deal with all variants (e.g., spaces):
xs = "[1,3,224,224], [32]" shapes = json.loads(xs) # [[1,3,224,224],[32]]

Follow Python syntax to only accept (8,) and throw an error for (8), which is treated as an integer instead of a tuple because buckets will be simplified in Python. In this case, I would suggest using eval to deal with all variants.
xs = "(1,3,224,224), (32,)" shapes = eval(xs, {}, {}) # Remember to disable all local and global symbols to isolate this expression. # shapes=[(1,3,224,224),(32,)]

Either way is fine for me, and please update the help message and make sure you have a unit test to cover corner cases.

comaniac · 2021-01-28T18:38:18Z

python/tvm/driver/tvmc/frontends.py

+        if input_shape:
+            raise TVMCException("--input-shape is not supported for {}".format(self.name()))
+


This is definitely too ad hoc

comaniac · 2021-01-28T18:40:28Z

python/tvm/driver/tvmc/frontends.py

        # pylint: disable=C0415
        import torch

-        traced_model = torch.jit.load(path)
-
-        inputs = list(traced_model.graph.inputs())[1:]


Is this approach not working at all? If it works for some cases, we should still use it first when --input-shape is missing.

I looked into this and I didn't find a way to extract inputs from the model after it has been saved and loaded. I asked on the PyTorch forum as well (https://discuss.pytorch.org/t/input-size-disappears-between-torch-jit-save-and-torch-jit-load/108955) and since I received a grand total of zero responses, I suspect it is a deliberate design decision. If there was a way, it would be good to keep it, of course, but in that form it doesn't work any more.

comaniac · 2021-01-28T18:40:49Z

python/tvm/driver/tvmc/frontends.py

@@ -389,6 +403,8 @@ def load_model(path, model_format=None):
    model_format : str, optional
        The underlying framework used to create the model.
        If not specified, this will be inferred from the file type.
+    input shape : list, optional
+        The shape of input tensor for PyTorch models


ditto. make it general instead of only for PyTorch.

comaniac · 2021-01-29T17:54:08Z

Include the functionalities in #7366.

leandron reviewed Jan 28, 2021

View reviewed changes

comaniac requested changes Jan 28, 2021

View reviewed changes

comaniac added status: need update need update based on feedbacks status: review in progress labels Jan 28, 2021

ekalda mentioned this pull request Jan 29, 2021

[TVMC] Allow manual shape specification in tvmc #7366

Merged

comaniac added status: suprceded PR is superceded by another one and removed status: need update need update based on feedbacks status: review in progress labels Jan 29, 2021

comaniac closed this Jan 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TVMC] Fix PyTorch support #7359

[TVMC] Fix PyTorch support #7359

ekalda commented Jan 28, 2021

ekalda commented Jan 28, 2021

leandron left a comment

leandron Jan 28, 2021

leandron Jan 28, 2021

comaniac Jan 28, 2021

comaniac Jan 28, 2021

comaniac Jan 28, 2021

comaniac Jan 28, 2021

comaniac Jan 28, 2021

comaniac Jan 28, 2021

ekalda Jan 29, 2021

comaniac Jan 28, 2021

comaniac commented Jan 29, 2021


		def parse_input_shapes(xs):
		"""Turn the string from --input-shape into a list.

		# Remove white space and extract numbers
		strshape = shape[1].replace(" ", "").split(",")

		if input_shape:
		raise TVMCException("--input-shape is not supported for {}".format(self.name()))

[TVMC] Fix PyTorch support #7359

[TVMC] Fix PyTorch support #7359

Conversation

ekalda commented Jan 28, 2021

ekalda commented Jan 28, 2021

leandron left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

comaniac commented Jan 29, 2021