-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[ONNX] Initial work to import pre-quantized ONNX Models #7802
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Awesome, excited to see this in action.
@@ -4165,15 +4165,8 @@ def verify_cumsum(indata, axis, exclusive=0, reverse=0, type="float32"): | |||
"test_cumsum_2d_axis_0/", | |||
"test_cumsum_2d_axis_1/", | |||
"test_cumsum_2d_negative_axis/", | |||
"test_dequantizelinear/", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we run these tests on CI?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, we're running all of the pre-serialized node tests that ship with ONNX against CPU now, except what's skipped in this list. Working on reducing what we skip, and I'll start enabling GPU soon.
thanks @mbrookhart @jwfromm |
* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear
* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear
* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear
* Add QuantizeLinear and DequantizeLinear * DynamicDequantizeLinear
This PR implements QuantlizeLinear, DequantizeLinear, and DynamicQuantizeLinear (which is deprecated but included for completeness).