Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add Bigbird ONNX config #16427

Merged
merged 3 commits into from
Apr 12, 2022
Merged

add Bigbird ONNX config #16427

merged 3 commits into from
Apr 12, 2022

Conversation

vumichien
Copy link
Contributor

What does this PR do?

Add Bigbird OnnxConfig to make this model available for conversion.

Who can review?

@lewtun @LysandreJik

@HuggingFaceDocBuilderDev
Copy link

HuggingFaceDocBuilderDev commented Mar 26, 2022

The documentation is not available anymore as the PR was closed or merged.

@ManuchehrBoudesh
Copy link

5040f17eba15504bad66b14a645bddd9b015ebb7 #15622

@lewtun lewtun assigned lewtun and unassigned lewtun Apr 11, 2022
Copy link
Member

@lewtun lewtun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this very clean implementation @vumichien 🚀 ! I have a small question about the changes to the modeling file, but this looks good to me 😄

Could you please fix the merge conflicts with the serialization.mdx file and check that the slow tests pass by running:

RUN_SLOW=1 pytest tests/onnx/test_onnx_v2.py -k "bigbird"

@@ -2994,7 +2994,7 @@ def forward(
# setting lengths logits to `-inf`
logits_mask = self.prepare_question_mask(question_lengths, seqlen)
if token_type_ids is None:
token_type_ids = (~logits_mask).long()
token_type_ids = torch.ones(logits_mask.size(), dtype=int) - logits_mask
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are these changes needed to enable inference with ONNX Runtime? Just trying to understand the need to change the modeling code

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@lewtun Yes, you are right. I have changed the type of logits_mask here from boolean to integer to enable inference with ONNX Runtime. The error looks like this if we use the original modeling code.

RuntimeError: 0INTERNAL ASSERT FAILED at "../torch/csrc/jit/ir/alias_analysis.cpp":611, please report a bug to PyTorch. We don't have an op for aten::fill_ but it isn't a special case.  Argument types: Tensor, bool, 

Candidates:
    aten::fill_.Scalar(Tensor(a!) self, Scalar value) -> (Tensor(a!))
    aten::fill_.Tensor(Tensor(a!) self, Tensor value) -> (Tensor(a!))

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the clarification! This looks fine to me :)

src/transformers/onnx/features.py Outdated Show resolved Hide resolved
@@ -3057,5 +3057,5 @@ def prepare_question_mask(q_lengths: torch.Tensor, maxlen: int):
# q_lengths -> (bz, 1)
mask = torch.arange(0, maxlen).to(q_lengths.device)
mask.unsqueeze_(0) # -> (1, maxlen)
mask = mask < q_lengths
mask = torch.where(mask < q_lengths, 1, 0)
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also in here, I have to change the mask type

@vumichien
Copy link
Contributor Author

@lewtun Thank you for reviewing my code. I have fixed the merge conflicts and all tests by running this RUN_SLOW=1 pytest tests/onnx/test_onnx_v2.py -k "bigbird" was passed.

@lewtun
Copy link
Member

lewtun commented Apr 12, 2022

This all looks good to me @vumichien - thanks for iterating! Gently pinging @LysandreJik or @sgugger for final approval

Copy link
Collaborator

@sgugger sgugger left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your PR!

@lewtun lewtun merged commit 9c9db75 into huggingface:main Apr 12, 2022
elusenji pushed a commit to elusenji/transformers that referenced this pull request Jun 12, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants