ORTSeq2SeqTrainer refactoring - Inference with ORTModel #359

JingyaHuang · 2022-08-23T21:54:28Z

What does this PR do?

Enable ONNX Runtime inference in ORTSeq2SeqTrainer by leveraging subclasses of ORTModel for sequence to sequence tasks.

* Refactorization of ORTOptimizer * Refactorization of ORTModel * Adapt examples according to refactorization * Adapt tests * Fix style * Remove quantizer modification * Fix style * Apply modifications from #270 for quantizer and optimizer to have same behavior * Add test for optimization of Seq2Seq models * Fix style * Add ort config saving when optimizing a model * Add ort config saving when quantizing a model * Add tests * Fix style * Adapt optimization examples * Fix readme * Remove unused parameter * Adapt quantization examples * Fix quantized model and ort config saving * Add documentation * Add model configuration saving to simplify loading of optimized model * Fix style * Fix description * Fix quantization tests * Remove opset argument which is onnx config default opset when exporting with ORTModels

* fix(optimization): handle empty file suffix * fix(quantization): handle empty file suffix * use pathlibfor save_dir * run test again * Update optimum/onnxruntime/quantization.py Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com> * ReRun test that failed because of cache (network) Co-authored-by: Ella Charlaix <80481427+echarlaix@users.noreply.github.com>

* fix class * Update optimization.mdx

…189) * Inference with ORTModel * Clean up unused imports * Replace Inference session by ort model * Inference with ORTModel * Replace in evaluation_loop * refactoring prediction_loop * ORTSeq2SeqTrainer refactoring - Inference with ORTModel (#359) * Override export of ORTSeq2SeqTrainer * Update OnnxConfigWithLoss wrapper * Fix ORTTrainer inference ort subclass parsing * Override the evaluation and prediction loop in ORTSeq2SeqTrainer * detect labels from input names * Detect loss from output names * Put back decoder with past * Put past key values in the correct place * remove if/else statement * Revert the inference code

* Inference with ORTModel * Clean up unused imports * Replace Inference session by ort model * Inference with ORTModel * Clean up unused imports * Replace Inference session by ort model * Update modeling for custom tasks * Replace in evaluation_loop * refectoring prediction_loop * ORTSeq2SeqTrainer refactoring - Inference with ORTModel (#359) * Override export of ORTSeq2SeqTrainer * Do not force download by default in ORTModel (#356) * Update OnnxConfigWithLoss wrapper * ORT optimizer refactorization (#294) * Refactorization of ORTOptimizer * Refactorization of ORTModel * Adapt examples according to refactorization * Fix ORTTrainer inference ort subclass parsing * Replace datasets.load_metric by evaluate * Add summarization example * Enable ORT inference * Fix inference args * Mention ORT inference in READMEs * Remove repetitve code in Trainer * Update examples to trfrs 4.22.1 * Fix qa example prediction error * Update summarization/README.md * Fix logger consistency * Make readme consistent with trfrs * Put back onnx config with past and loss test

JingyaHuang added 2 commits August 23, 2022 21:49

Override export of ORTSeq2SeqTrainer

977eea5

Merge branch 'main' into refactoring-ort-seq2seqtrainer

1cda501

JingyaHuang changed the base branch from refactoring-orttrainer-inf to main August 24, 2022 08:35

JingyaHuang changed the base branch from main to refactoring-orttrainer-inf August 24, 2022 08:35

fxmarty and others added 13 commits August 24, 2022 12:12

Do not force download by default in ORTModel (#356)

122a9d8

Update OnnxConfigWithLoss wrapper

8abd24a

Fix import (#360)

661f442

Fix export of decoders

056741c

Add flag to export only decoders

8c5c17f

Fix ORTTrainer inference ort subclass parsing

99db213

Merge branch 'main' into refactoring-ort-seq2seqtrainer

5a9fe71

Override the evaluation and prediction loop in ORTSeq2SeqTrainer

3c26893

Fix documentation (#369)

815a762

* fix class * Update optimization.mdx

Fix label smoother device prob

7739572

Fix lm_logits and labels dimension mismatch

2d0d2b3

JingyaHuang marked this pull request as ready for review September 6, 2022 22:22

JingyaHuang added 2 commits September 7, 2022 08:19

Clean up

4ddf1cc

Merge branch 'main' into refactoring-ort-seq2seqtrainer

3d5e889

JingyaHuang merged commit 9a925c8 into refactoring-orttrainer-inf Sep 7, 2022

JingyaHuang deleted the refactoring-ort-seq2seqtrainer branch September 7, 2022 08:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ORTSeq2SeqTrainer refactoring - Inference with ORTModel #359

ORTSeq2SeqTrainer refactoring - Inference with ORTModel #359

JingyaHuang commented Aug 23, 2022

ORTSeq2SeqTrainer refactoring - Inference with ORTModel #359

ORTSeq2SeqTrainer refactoring - Inference with ORTModel #359

Conversation

JingyaHuang commented Aug 23, 2022

What does this PR do?