[Torch][Quantized] Fix converting serialized quantized models #5839

masahi · 2020-06-18T02:26:23Z

This is a workaround for the issue reported in pytorch/pytorch#39690

In short, if a quantized PyTorch model is serialized and loaded back, dtypes of output tensors are dropped and the loaded model doesn't have QUInt8 types at all.

This becomes a problem when converting some Torch ops. For example, below the output dtype of quantize_per_tensor becomes float (Tensor means float tensor, wrong), so aten::adaptive_avg_pool2d thinks this is a float operation. But obviously the output of aten::quantize_per_tensor should be a quantized tensor, so aten::adaptive_avg_pool2d has to be converted to the quantized version.

The quantized resnet in torchvision uses aten::adaptive_avg_pool2d. So right now if we save and load back the qresnet, we get garbage result.

  %input.1 : Tensor = aten::quantize_per_tensor(%X.1, %7, %8, %9) # /home/masa/anaconda3
  ...
  %Xq.1 : Tensor = aten::adaptive_avg_pool2d(%input.1, %12)

please review @siju-samuel @anijain2305
cc @jjohnson-arm

tests/python/frontend/pytorch/qnn_test.py

python/tvm/relay/frontend/pytorch.py

siju-samuel

LGTM. Thanks for the fix @masahi

anijain2305

LGTM

…#5839) * [Torch] Fix converting serialized quantized models * clean up dtype check * comment clean up

[Torch] Fix converting serialized quantized models

ae7fdc1

siju-samuel reviewed Jun 18, 2020

View reviewed changes

tests/python/frontend/pytorch/qnn_test.py Outdated Show resolved Hide resolved

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

clean up dtype check

081d318

siju-samuel reviewed Jun 18, 2020

View reviewed changes

python/tvm/relay/frontend/pytorch.py Outdated Show resolved Hide resolved

comment clean up

8bd86ed

siju-samuel approved these changes Jun 18, 2020

View reviewed changes

masahi merged commit 082874c into apache:master Jun 18, 2020

anijain2305 reviewed Jun 18, 2020

View reviewed changes

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Jun 30, 2020

[Torch][Quantized] Fix converting serialized quantized models (apache…

fa4ec78

…#5839) * [Torch] Fix converting serialized quantized models * clean up dtype check * comment clean up

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Jul 2, 2020

[Torch][Quantized] Fix converting serialized quantized models (apache…

2187acf

…#5839) * [Torch] Fix converting serialized quantized models * clean up dtype check * comment clean up

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch][Quantized] Fix converting serialized quantized models #5839

[Torch][Quantized] Fix converting serialized quantized models #5839

masahi commented Jun 18, 2020 •

edited

Loading

siju-samuel left a comment

anijain2305 left a comment

[Torch][Quantized] Fix converting serialized quantized models #5839

[Torch][Quantized] Fix converting serialized quantized models #5839

Conversation

masahi commented Jun 18, 2020 • edited Loading

siju-samuel left a comment

Choose a reason for hiding this comment

anijain2305 left a comment

Choose a reason for hiding this comment

masahi commented Jun 18, 2020 •

edited

Loading