[ONNX] Initial work to import pre-quantized ONNX Models #7802

mbrookhart · 2021-04-06T18:27:08Z

This PR implements QuantlizeLinear, DequantizeLinear, and DynamicQuantizeLinear (which is deprecated but included for completeness).

jwfromm

Awesome, excited to see this in action.

masahi · 2021-04-06T19:23:01Z

tests/python/frontend/onnx/test_forward.py

@@ -4165,15 +4165,8 @@ def verify_cumsum(indata, axis, exclusive=0, reverse=0, type="float32"):
    "test_cumsum_2d_axis_0/",
    "test_cumsum_2d_axis_1/",
    "test_cumsum_2d_negative_axis/",
-    "test_dequantizelinear/",


Do we run these tests on CI?

Yes, we're running all of the pre-serialized node tests that ship with ONNX against CPU now, except what's skipped in this list. Working on reducing what we skip, and I'll start enabling GPU soon.

masahi · 2021-04-07T03:39:13Z

thanks @mbrookhart @jwfromm

* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear

Matthew added 2 commits April 6, 2021 11:38

Add QuantizeLinear and DequantizeLinear

0d77e05

DynamicDequantizeLinear

2b1866e

mbrookhart requested review from masahi and jwfromm April 6, 2021 18:27

jwfromm approved these changes Apr 6, 2021

View reviewed changes

masahi reviewed Apr 6, 2021

View reviewed changes

Merge branch 'main' into onnx_quant

92d5c76

masahi merged commit 2d3f781 into apache:main Apr 7, 2021

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request May 6, 2021

[ONNX] Initial work to import pre-quantized ONNX Models (apache#7802)

2be3e88

* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request May 6, 2021

[ONNX] Initial work to import pre-quantized ONNX Models (apache#7802)

566921c

* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request May 6, 2021

[ONNX] Initial work to import pre-quantized ONNX Models (apache#7802)

0a43ad0

* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request May 11, 2021

[ONNX] Initial work to import pre-quantized ONNX Models (apache#7802)

69e2448

* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear

tmoreau89 mentioned this pull request Aug 24, 2021

[Tracking Issue][ONNX] Quantized operator support in ONNX importer #8838

Closed

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Initial work to import pre-quantized ONNX Models #7802

[ONNX] Initial work to import pre-quantized ONNX Models #7802

mbrookhart commented Apr 6, 2021

jwfromm left a comment

masahi Apr 6, 2021

mbrookhart Apr 6, 2021

masahi commented Apr 7, 2021

[ONNX] Initial work to import pre-quantized ONNX Models #7802

[ONNX] Initial work to import pre-quantized ONNX Models #7802

Conversation

mbrookhart commented Apr 6, 2021

jwfromm left a comment

Choose a reason for hiding this comment

masahi Apr 6, 2021

Choose a reason for hiding this comment

mbrookhart Apr 6, 2021

Choose a reason for hiding this comment

masahi commented Apr 7, 2021