Update imagenet quantization script for MKL-DNN #14407

ciyongch · 2019-03-13T02:35:33Z

Description

This PR is to skip quantizing FullyConnected layers when its inputs are signed int8 and the subgraph backend is set to MKLDNN, due to the limitation in current version of MKL-DNN. This feature will be enabled when MKL-DNN is upgraded to 0.18 or higher.
For not using MKLDNN subgraph backend (in general mode), users can still use both int8 and uint8 quantized FullyConnected for CPU platform (in this case, int8 will go to IGEMM path, but not providing requantize/dequantize fusion feature).

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

[] The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here
@pengzhao-intel @TaoLv

pengzhao-intel · 2019-03-13T03:11:18Z

example/quantization/imagenet_gen_qsym_mkldnn.py

 if exclude_first_conv:
 excluded_sym_names += ['conv0']
 elif args.model == 'imagenet1k-inception-bn':
 rgb_mean = '123.68,116.779,103.939'
 rgb_std = '1,1,1'
- excluded_sym_names += ['flatten']
+ # TODO, fc1 will be enabled after MKL-DNN 0.18


v0.18 will be available soon. @TaoLv is WIP for the upgrading.
Could we wait several days for v0.18?

For now, these three examples will fail due to limitation of quantized FullyConnected (as it's enabled in current script).
If v0.18 will be merged into master in a short time, then this would be fine.

karan6181 · 2019-03-13T18:28:31Z

@mxnet-label-bot add [Quantization, MKLDNN]

pengzhao-intel · 2019-03-18T06:02:33Z

@ciyongch 0.18 PR is merged. Please check if the problem is fixed. #13668

ciyongch · 2019-03-18T06:46:07Z

@pengzhao-intel , sounds great, there's some check in current quantized FullyConnected to enable signed int8 inputs needs to be removed. After that, the problem should be fixed. Another PR to enable this is working in progress :)

ciyongch · 2019-03-19T05:15:52Z

Since PR #14466 enables signed int8 inputs to quantized FullyConnected, no need to apply this patch. Close it now.

skip quantizing FullyConnceted when inputs are signed int8

da068a8

ciyongch requested a review from szha as a code owner March 13, 2019 02:35

pengzhao-intel reviewed Mar 13, 2019

View reviewed changes

marcoabreu added MKLDNN Quantization Issues/Feature Requests related to Quantization labels Mar 13, 2019

xinyu-intel mentioned this pull request Mar 19, 2019

[MKL-DNN] Enable s8 support for inner product and 3d input with flatten=false #14466

Merged

7 tasks

ciyongch closed this Mar 19, 2019

ciyongch deleted the update_imagenet_qsym branch May 22, 2019 02:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update imagenet quantization script for MKL-DNN #14407

Update imagenet quantization script for MKL-DNN #14407

ciyongch commented Mar 13, 2019

pengzhao-intel Mar 13, 2019

ciyongch Mar 13, 2019

karan6181 commented Mar 13, 2019

pengzhao-intel commented Mar 18, 2019 •

edited

Loading

ciyongch commented Mar 18, 2019

ciyongch commented Mar 19, 2019

Update imagenet quantization script for MKL-DNN #14407

Update imagenet quantization script for MKL-DNN #14407

Conversation

ciyongch commented Mar 13, 2019

Description

Checklist

Essentials

Changes

Comments

pengzhao-intel Mar 13, 2019

Choose a reason for hiding this comment

ciyongch Mar 13, 2019

Choose a reason for hiding this comment

karan6181 commented Mar 13, 2019

pengzhao-intel commented Mar 18, 2019 • edited Loading

ciyongch commented Mar 18, 2019

ciyongch commented Mar 19, 2019

pengzhao-intel commented Mar 18, 2019 •

edited

Loading