Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MXNet pre-quantized BERT #6039

Merged
merged 7 commits into from
Jul 20, 2020
Merged

MXNet pre-quantized BERT #6039

merged 7 commits into from
Jul 20, 2020

Conversation

anijain2305
Copy link
Contributor

@anijain2305 anijain2305 commented Jul 11, 2020

MXnet pre-quantized BERT model - https://gluon-nlp.mxnet.io/examples/sentence_embedding/bert.html#Quantize-the-model

Features added in this PR

  • Support for Tensor quantization for MXNet Dense operator
  • Support for Channel quantization for MXNet Dense operator
  • Adding channel wise support for dequantize op
  • Support softmax use_length for axis=-1

@anijain2305 anijain2305 marked this pull request as ready for review July 12, 2020 03:10
@anijain2305
Copy link
Contributor Author

@icemelon9 @eric-haibin-lin @shoubhik

Please review

@anijain2305
Copy link
Contributor Author

@icemelon9 Can you please manage this PR?

Copy link
Member

@icemelon icemelon left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@icemelon icemelon merged commit b95ac1f into apache:master Jul 20, 2020
@icemelon
Copy link
Member

Thanks @anijain2305

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Aug 26, 2020
* MXNet pre-quantized BERT

* Comments.

* Trigger.

* Retrigger CI

* Retrigger CI

* Retrigger CI

* Retrigger
trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Aug 26, 2020
* MXNet pre-quantized BERT

* Comments.

* Trigger.

* Retrigger CI

* Retrigger CI

* Retrigger CI

* Retrigger
trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Sep 2, 2020
* MXNet pre-quantized BERT

* Comments.

* Trigger.

* Retrigger CI

* Retrigger CI

* Retrigger CI

* Retrigger
trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Sep 3, 2020
* MXNet pre-quantized BERT

* Comments.

* Trigger.

* Retrigger CI

* Retrigger CI

* Retrigger CI

* Retrigger
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants