add tensorrt quantization #172

Data-Iab · 2022-04-22T14:11:29Z

New 💥 :
- TensorRt engines can now be built with int8 precision using Post Training Quantization.
- 4 calibrators are available for quantization : MinMaxCalibrator, LegacyCalibrator, EntropyCalibrator and EntropyCalibrator2.
- Added a QuantizedModel interface to convert model to quantized model for Training Aware Quantization.
Fixed 🔧 :
- Adapt graph option is removed, we just adapt graph once it's exported from torch to ONNX.

Data-Iab added 4 commits April 22, 2022 15:57

add int8 support

6a002cc

add int8 quantization calibrators

54cc74b

add quantized model interface

6ab7205

fix: adapt graph after exporting to onnx

b06d088

Data-Iab requested a review from thibo73800 April 22, 2022 14:11

Data-Iab self-assigned this Apr 22, 2022

Data-Iab added 9 commits April 22, 2022 16:35

fix import path

0c88ceb

refactor: permute methods parameters

f6e6636

fix: remove calibration file before launching calibration

16101c4

set fake quantization for int8 precision

9de846b

add quantization platform check

17fe89d

add multi-input models calibration support

867cf37

add data streamer samples type assertions

e8b6fd2

add quantization multi-input support

7344a62

fix last calibration batch index

0f36f91

thibo73800 approved these changes May 9, 2022

View reviewed changes

thibo73800 merged commit 4caa0d2 into master May 9, 2022

thibo73800 deleted the dev branch May 9, 2022 15:48

Provide feedback