GPT-Neo ONNX export #12911

michaelbenayoun · 2021-07-27T12:55:48Z

What does this PR do?

This PR enables the export of GPT-Neo to ONNX by extending the new module transformers.onnx.

It also provides a possible way of implementing the export for specific tasks: the task can be specified when instantiating an OnnxConfig. It is a nice approach because it makes factoring most of the code for the inputs / outputs very easy, but it is less aligned with transformers DNA than having subclasses (such as OnnxConfigForSequenceClassification, etc) taking care of that.

The issue with having many subclasses is that it would have to be done everytime one wants to add the support for a model.
What do you think?

src/transformers/models/gpt_neo/configuration_gpt_neo.py

mfuntowicz · 2021-07-27T13:05:06Z

@sgugger @LysandreJik What do you think would be the best way to approach this exporting features for downstream task? I think we have the two possible ways:

One config per task XOnnxConfigForY => Follow the general "duplication" pattern in transformers
One config with task as parameter encapsulating the logic for I/O for each possible task => Potentially reduce the LoC

sgugger · 2021-07-27T13:15:22Z

I think using a task argument is a nice way of avoiding too many new classes which would crowd the main init of transformers.

…jects (draft version with lots of printing and comments, comitted to have them available if need be)

…jects

…entionMixin._get_block_length_and_num_blocks in a graph friendly fashion

mfuntowicz · 2021-08-04T08:37:24Z

@michaelbenayoun is the PR ready for review? 🥰

…ication

michaelbenayoun · 2021-08-04T10:53:48Z

@michaelbenayoun is the PR ready for review?
Yes, it is!

I also implemented a "factory" called FeaturesManager located at onnx/features.py from what was done before by @mfuntowicz in onnx/__main__.py which manages the mapping between features and models / onnx configs.

From what @sgugger said, I went with the "task argument" approach. Basically, a feature is the combination of a task and the potential use of past keys and values, for instance:

sequence-classification
sequence-classification-with-past

Any feature containing "-with-past" will be mapped by the factory to an OnnxConfig instantiated using the with_past method.

@mfuntowicz any comments on the changes I have made?

mfuntowicz · 2021-08-04T12:36:17Z

src/transformers/models/gpt_neo/modeling_gpt_neo.py

@@ -1121,7 +1121,7 @@ def forward(
                    f"unexpected if using padding tokens in conjunction with `inputs_embeds.`"
                )

-        pooled_logits = logits[range(batch_size), sequence_lengths]
+        pooled_logits = logits[torch.arange(batch_size), sequence_lengths]


I think you can use torch.take_along_dim(logits, sequence_lengths, dim=1). You might need to match shape of logits and sequence_lengths.

=> It will remove the need to gather from the shape object.

I applied the following changes:

if isinstance(sequence_lengths, torch.Tensor): pooled_logits = torch.take_along_dim( logits, indices=sequence_lengths.unsqueeze(1).unsqueeze(1), dim=1 ).squeeze() else: pooled_logits = logits[torch.arange(batch_size), sequence_lengths]

The forward pass works and give the same output as original version, but ONNX conversion fails with:
RuntimeError: Exporting the operator take_along_dim to ONNX opset version 11 is not supported. Please feel free to request support or submit a pull request on PyTorch GitHub..

sgugger

LGTM, thanks a lot for the PR!

LysandreJik

Nice! It requires a bit more reading to understand what to add when manually adding a new configuration. It would be nice to add the ONNX part to the existing templates so that when adding new models, users would automatically make them compatible with ONNX (not in this PR though).

LGTM!

GPT-Neo ONNX export and task / feature refactoring Authored-by: Michael Benayoun <michael@huggingface.co>

michaelbenayoun added 2 commits July 26, 2021 18:15

onnx export for GPT-Neo

8bdf05b

task support for ONNX export

92c9e85

michaelbenayoun requested review from mfuntowicz, patil-suraj and sgugger July 27, 2021 12:55

mfuntowicz reviewed Jul 27, 2021

View reviewed changes

src/transformers/models/gpt_neo/configuration_gpt_neo.py Outdated Show resolved Hide resolved

mfuntowicz requested a review from LysandreJik July 27, 2021 13:02

michaelbenayoun added 6 commits July 28, 2021 18:11

custom_unfold implementation without Python dynamic control flow / ob…

21d4641

…jects (draft version with lots of printing and comments, comitted to have them available if need be)

custom_unfold implementation without Python dynamic control flow / ob…

d51a30e

…jects

fixed TYPE_HINT issue in __init__.py

9068eb2

fixing vectorized implementation of unfold, and implemented GPTNeoAtt…

425f59a

…entionMixin._get_block_length_and_num_blocks in a graph friendly fashion

support for use_past

2d3ce2f

task / feature refactoring

fd421dc

mfuntowicz mentioned this pull request Aug 4, 2021

convert_graph_to_onnx.convert broken for gpt-neo-x.xB since 4.5.0.dev0 #12984

Closed

4 tasks

michaelbenayoun added 2 commits August 4, 2021 12:38

task / feature refactoring (part 2), and support for sequence-classif…

237bc0b

…ication

test fix

1347095

mfuntowicz approved these changes Aug 4, 2021

View reviewed changes

sgugger approved these changes Aug 4, 2021

View reviewed changes

LysandreJik approved these changes Aug 4, 2021

View reviewed changes

documented PatchingSpec

da26af8

michaelbenayoun marked this pull request as ready for review August 4, 2021 15:24

mfuntowicz approved these changes Aug 5, 2021

View reviewed changes

michaelbenayoun merged commit a6d62aa into huggingface:master Aug 5, 2021

LysandreJik pushed a commit that referenced this pull request Aug 9, 2021

GPT-Neo ONNX export (#12911)

94b7db9

GPT-Neo ONNX export and task / feature refactoring Authored-by: Michael Benayoun <michael@huggingface.co>

whiteRa2bit mentioned this pull request Aug 18, 2021

GPT-Neo ONNX Inference with past is broken #13175

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPT-Neo ONNX export #12911

GPT-Neo ONNX export #12911

michaelbenayoun commented Jul 27, 2021

mfuntowicz commented Jul 27, 2021

sgugger commented Jul 27, 2021

mfuntowicz commented Aug 4, 2021

michaelbenayoun commented Aug 4, 2021

mfuntowicz Aug 4, 2021

michaelbenayoun Aug 4, 2021

sgugger left a comment

LysandreJik left a comment •

edited

Loading

GPT-Neo ONNX export #12911

GPT-Neo ONNX export #12911

Conversation

michaelbenayoun commented Jul 27, 2021

What does this PR do?

mfuntowicz commented Jul 27, 2021

sgugger commented Jul 27, 2021

mfuntowicz commented Aug 4, 2021

michaelbenayoun commented Aug 4, 2021

mfuntowicz Aug 4, 2021

Choose a reason for hiding this comment

michaelbenayoun Aug 4, 2021

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

LysandreJik left a comment • edited Loading

Choose a reason for hiding this comment

LysandreJik left a comment •

edited

Loading