[RFC] Laying down building stone for more flexible ONNX export capabilities #11786

mfuntowicz · 2021-05-20T13:35:44Z

This PR aims at reworking the way the ONNX export tool work by introducing a static, checked description format to provide ONNX exporters (pt almost done, TF will follow) all the required knobs.

More specifically this PR introduces the following concepts:

OnnxConfig dataclass which enforces a model to be supported to describe all the properties to generate proper export
OnnxVariable namedtuple which describe a variables w.r.t the name of the variable, shape and potentially how many time it's "repeated" => Useful for past_keys

Test case was done initially for BART model, without use_cache=True supports.

For the sake of completeness, dropping support for use_cache=True is currently needed because we have a double nested tuple at the core of the past_keys output structure which would require multiple level of dynamic axis, not currently supported by ONNX.

This might be something we can work on in the future, potentially introducing a ONNX compatible output structure getting rid of the nested tuples layout and activable from a config property (to be discussed further later on).

Update 1:

I managed to enable exporting with nested structures such as past_key_values for GPT2.
Need to work on enabling the same for using such values as inputs to the model

Supported models:

mfuntowicz · 2021-05-25T10:05:32Z

Example of potential command line to export bert-base-cased =>

python3 -m transformers.onnx -f pytorch --model=bert-base-cased --features=default --optimize --optimization-level=all onnx/bert-base-cased/

Narsil

Late to the party, but just a suggestion,

Why do OnnxConfig does not take the real Config object as an argument ? It would make all string-like inference $config.hidden_size unnecessary anymore, no ?

LysandreJik · 2021-07-01T14:50:42Z

See the contributed docs here https://235542-155220641-gh.circle-artifacts.com/0/docs/_build/html/serialization.html

mfuntowicz · 2021-07-01T15:00:58Z

Idea: Rename the convert_pytorch to export so we have the exact same hierarchy than PyTorch:

PyTorch: torch.onnx.export
Transformers: transformers.onnx.export

wdyt?

LysandreJik · 2021-07-01T15:09:04Z

That's a great idea!

mfuntowicz · 2021-07-06T18:30:55Z

@Narsil we moved forward on your suggestion, can you have a look (one more time 😄) 🙏🏻

This reverts commit f665efb.

leoozy · 2021-07-20T07:51:53Z

Hello, when we can use the transformers.onnx?

LysandreJik · 2021-07-20T08:19:34Z

You already can when installing from source:

pip install git+https://github.com/huggingface/transformers

We'll do a release this week (probably Thursday or Friday) and it will be in a pypi release then.

kagrze · 2021-08-11T16:01:00Z

src/transformers/models/t5/configuration_t5.py

+ [
+ ("input_ids", {0: "batch", 1: "encoder_sequence"}),
+ ("attention_mask", {0: "batch", 1: "encoder_sequence"}),
+ ("decoder_input_ids", {0: "batch"}),


I see that the decoder_input_ids length is fixed. I guess, this makes sense when use_past is True, because we feed only one token (the one generated in the previous step). However, when use_past is False then we need to feed all the previously generated tokens, don't we?

Avi-avidan · 2022-05-12T07:38:02Z

hi, this thread is super important.
Is there support for bart text2text_generation export to onnx (more specifically for summarization tasks) ?

mfuntowicz requested a review from LysandreJik May 20, 2021 13:35

HenryDashwood mentioned this pull request May 25, 2021

Different behaviour when extending this project to Bart Ki6an/fastT5#7

Open

This was referenced May 31, 2021

convert_graph_to_onnx.convert broken for model bart-large / wmt19-en-de #9803

Closed

Add support for exporting summarization models to ONNX #7404

Closed

Narsil reviewed Jun 8, 2021

View reviewed changes

tianleiwu mentioned this pull request Jun 16, 2021

torch export onnx LongFormer but cannot inference model with onnxruntime-cpu microsoft/onnxruntime#8071

Open

LysandreJik mentioned this pull request Jun 25, 2021

Can not use the convert_graph_to_onnx.py to convert the pytorch model to onnx model #6570

Closed

mfuntowicz force-pushed the onnx_export_v2 branch 2 times, most recently from aedaecc to 4311e8a Compare June 28, 2021 18:08

LysandreJik mentioned this pull request Jul 2, 2021

convert_graph_to_onnx.py failing to run on Wav2Vec2 models #12456

Closed

2 tasks

LysandreJik mentioned this pull request Jul 8, 2021

PEGASUS using ONNX #12573

Closed

mfuntowicz added 14 commits July 8, 2021 15:24

Laying down building stone for more flexible ONNX export capabilities

aa61398

Ability to provide a map of config key to override before exporting.

6ddb072

Makes it possible to export BART with/without past keys.

c05d195

Supports simple mathematical syntax for OnnxVariable.repeated

f47c892

Effectively apply value override from onnx config for model

228aad2

Supports export with additional features such as with-past for seq2seq

c26e83f

Store the output path directly in the args for uniform usage across.

5f8c06a

Make BART_ONNX_CONFIG_* constants and fix imports.

bd4c0fe

Support BERT model.

9820ae9

Use tokenizer for more flexibility in defining the inputs of a model.

8f58669

Add TODO as remainder to provide the batch/sequence_length as CLI args

d9278ef

Enable optimizations to be done on the model.

d5edb99

Enable GPT2 + past

971e1bb

Improve model validation with outputs containing nested structures

e49ce6d

mfuntowicz and others added 18 commits July 8, 2021 15:26

Enable outputs validation for default export.

32530fd

Remove graph opt lvls.

65adf04

Last commit with on-going past commented.

5515e19

Style.

292e75f

Disabled with_past for now

9e0ce91

Remove unused imports.

c60f26e

Remove framework argument

dd9a487

Remove TFPreTrainedModel reference

c671f23

Add documentation

a2f101e

Add onnxruntime tests to CircleCI

992167e

Add test

3ad9fdd

Rename convert_pytorch to export

e6e33f5

Use OrderedDict for dummy inputs

76ec16c

WIP Wav2Vec2

90c8355

Revert "WIP Wav2Vec2"

65fd445

This reverts commit f665efb.

Style

91dfa99

Use OrderedDict for I/O

435c5ee

Style.

d79c03c

mfuntowicz force-pushed the onnx_export_v2 branch from bf5947a to d79c03c Compare July 8, 2021 13:29

mfuntowicz and others added 2 commits July 8, 2021 15:37

Specify OrderedDict documentation.

770e920

Style :)

662a556

LysandreJik merged commit 2aa3cd9 into master Jul 8, 2021

LysandreJik deleted the onnx_export_v2 branch July 8, 2021 14:54

softworkz mentioned this pull request Jul 19, 2021

Add ONNX export for gpt_neo models #12788

Closed

kagrze reviewed Aug 11, 2021

View reviewed changes

juneoh mentioned this pull request Sep 14, 2021

Previously working code fails on Cloud TPU with signal SIGABRT pytorch/xla#3123

Closed

swoook mentioned this pull request Dec 1, 2021

Request a feature to export KoBART for sequence classification to ONNX Runtime (ORT) swoook/KoBART#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Laying down building stone for more flexible ONNX export capabilities #11786

[RFC] Laying down building stone for more flexible ONNX export capabilities #11786

mfuntowicz commented May 20, 2021 •

edited

Loading

mfuntowicz commented May 25, 2021 •

edited

Loading

Narsil left a comment

LysandreJik commented Jul 1, 2021

mfuntowicz commented Jul 1, 2021

LysandreJik commented Jul 1, 2021

mfuntowicz commented Jul 6, 2021

leoozy commented Jul 20, 2021

LysandreJik commented Jul 20, 2021

kagrze Aug 11, 2021 •

edited

Loading

Avi-avidan commented May 12, 2022

[RFC] Laying down building stone for more flexible ONNX export capabilities #11786

[RFC] Laying down building stone for more flexible ONNX export capabilities #11786

Conversation

mfuntowicz commented May 20, 2021 • edited Loading

mfuntowicz commented May 25, 2021 • edited Loading

Narsil left a comment

Choose a reason for hiding this comment

LysandreJik commented Jul 1, 2021

mfuntowicz commented Jul 1, 2021

LysandreJik commented Jul 1, 2021

mfuntowicz commented Jul 6, 2021

leoozy commented Jul 20, 2021

LysandreJik commented Jul 20, 2021

kagrze Aug 11, 2021 • edited Loading

Choose a reason for hiding this comment

Avi-avidan commented May 12, 2022

mfuntowicz commented May 20, 2021 •

edited

Loading

mfuntowicz commented May 25, 2021 •

edited

Loading

kagrze Aug 11, 2021 •

edited

Loading