ORT optimizer refactorization #294

echarlaix · 2022-07-13T17:38:36Z

Refactorization of ORTOptimizer

HuggingFaceDocBuilderDev · 2022-07-13T17:42:31Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

…e behavior

regisss

Great work @echarlaix 🔥
I just left a few minor comments regarding example READMEs.

examples/onnxruntime/quantization/question-answering/README.md

examples/onnxruntime/quantization/text-classification/README.md

examples/onnxruntime/quantization/token-classification/README.md

philschmid

LGTM! Awesome work 🔥✅

…ng with ORTModels

* Override export of ORTSeq2SeqTrainer * Do not force download by default in ORTModel (#356) * Update OnnxConfigWithLoss wrapper * ORT optimizer refactorization (#294) * Refactorization of ORTOptimizer * Refactorization of ORTModel * Adapt examples according to refactorization * Adapt tests * Fix style * Remove quantizer modification * Fix style * Apply modifications from #270 for quantizer and optimizer to have same behavior * Add test for optimization of Seq2Seq models * Fix style * Add ort config saving when optimizing a model * Add ort config saving when quantizing a model * Add tests * Fix style * Adapt optimization examples * Fix readme * Remove unused parameter * Adapt quantization examples * Fix quantized model and ort config saving * Add documentation * Add model configuration saving to simplify loading of optimized model * Fix style * Fix description * Fix quantization tests * Remove opset argument which is onnx config default opset when exporting with ORTModels * Fix import (#360) * Fix export of decoders * Add flag to export only decoders * Fix ORTTrainer inference ort subclass parsing * Fix filenames when empty suffix given (#363) * fix(optimization): handle empty file suffix * fix(quantization): handle empty file suffix * use pathlibfor save_dir * run test again * Update optimum/onnxruntime/quantization.py Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * ReRun test that failed because of cache (network) Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * Override the evaluation and prediction loop in ORTSeq2SeqTrainer * Fix documentation (#369) * fix class * Update optimization.mdx * Fix label smoother device prob * Fix lm_logits and labels dimension mismatch * Clean up Co-authored-by: fxmarty <9808326+fxmarty@users.noreply.github.com> Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> Co-authored-by: Pierre Snell <ierezell@gmail.com> Co-authored-by: Pierre Snell <pierre.snell@botpress.com> Co-authored-by: Philipp Schmid <32632186+philschmid@users.noreply.github.com>

* Inference with ORTModel * Clean up unused imports * Replace Inference session by ort model * Inference with ORTModel * Clean up unused imports * Replace Inference session by ort model * Update modeling for custom tasks * Replace in evaluation_loop * refectoring prediction_loop * ORTSeq2SeqTrainer refactoring - Inference with ORTModel (#359) * Override export of ORTSeq2SeqTrainer * Do not force download by default in ORTModel (#356) * Update OnnxConfigWithLoss wrapper * ORT optimizer refactorization (#294) * Refactorization of ORTOptimizer * Refactorization of ORTModel * Adapt examples according to refactorization * Fix ORTTrainer inference ort subclass parsing * Replace datasets.load_metric by evaluate * Add summarization example * Enable ORT inference * Fix inference args * Mention ORT inference in READMEs * Remove repetitve code in Trainer * Update examples to trfrs 4.22.1 * Fix qa example prediction error * Update summarization/README.md * Fix logger consistency * Make readme consistent with trfrs * Put back onnx config with past and loss test

echarlaix added 7 commits July 13, 2022 19:29

Refactorization of ORTOptimizer

bef4e10

Refactorization of ORTModel

bf7f38f

Adapt examples according to refactorization

16da1cb

Adapt tests

bb8e100

Fix style

f744d18

Remove quantizer modification

3739019

Fix style

af56854

echarlaix requested a review from philschmid July 15, 2022 08:40

philschmid mentioned this pull request Jul 17, 2022

Refactor ORTQuantizer #270

Merged

Merge branch 'main' into ort-optimizer-refactorization

a2eb07c

JingyaHuang mentioned this pull request Aug 12, 2022

Improve the compatibility dealing with large ONNX proto in ORTOptimizer and ORTQuantizer #332

Merged

3 tasks

echarlaix added 17 commits August 22, 2022 14:31

Merge branch main into feature branch

8f1871f

Apply modifications from #270 for quantizer and optimizer to have sam…

93f56ff

…e behavior

Add test for optimization of Seq2Seq models

d4873f0

Fix style

e74f425

Add ort config saving when optimizing a model

8974aaa

Add ort config saving when quantizing a model

aef3aff

Add tests

85994e4

Fix style

58b0352

Adapt optimization examples

dcfbcbc

Fix readme

864680d

Remove unused parameter

720c05d

Change quantization approach to dynamic in readmes

d6720b5

Adapt quantization examples

c4013fd

Fix quantized model and ort config saving

d040ffc

Add documentation

c1988fb

Add model configuration saving to simplify loading of optimized model

51d0a96

Fix style

876ab55

echarlaix marked this pull request as ready for review August 23, 2022 10:57

regisss approved these changes Aug 23, 2022

View reviewed changes

examples/onnxruntime/quantization/question-answering/README.md Show resolved Hide resolved

examples/onnxruntime/quantization/text-classification/README.md Show resolved Hide resolved

examples/onnxruntime/quantization/token-classification/README.md Show resolved Hide resolved

echarlaix added 2 commits August 23, 2022 17:41

Fix readmes description

a949947

Fix quantization tests

ec3638a

echarlaix requested a review from michaelbenayoun August 23, 2022 15:45

philschmid approved these changes Aug 24, 2022

View reviewed changes

Remove opset argument which is onnx config default opset when exporti…

9ab642b

…ng with ORTModels

echarlaix merged commit fb7e303 into main Aug 24, 2022

echarlaix deleted the ort-optimizer-refactorization branch August 24, 2022 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORT optimizer refactorization #294

ORT optimizer refactorization #294

echarlaix commented Jul 13, 2022

HuggingFaceDocBuilderDev commented Jul 13, 2022

regisss left a comment

philschmid left a comment

ORT optimizer refactorization #294

ORT optimizer refactorization #294

Conversation

echarlaix commented Jul 13, 2022

HuggingFaceDocBuilderDev commented Jul 13, 2022

regisss left a comment

Choose a reason for hiding this comment

philschmid left a comment

Choose a reason for hiding this comment