fix ONNX support for bloom #18456

NouamaneTazi · 2022-08-03T15:23:14Z

Merged on #18344

This PR aims to fix ONNX export of bloom. All the following tests are passing:

RUN_SLOW=1 pytest tests/onnx/test_onnx_v2.py -k "bloom"
RUN_SLOW=1 pytest tests/models/bloom/test_modeling_bloom.py

because `seq_length` could be a tensor and we don't want to change its value

thomasw21

Thanks! LGTM with a tiny little nit.

src/transformers/models/bloom/configuration_bloom.py

HuggingFaceDocBuilderDev · 2022-08-03T15:34:36Z

The documentation is not available anymore as the PR was closed or merged.

src/transformers/models/bloom/modeling_bloom.py

* Cleanup some code * Improve signatures * Try to reduce the number of reshape/copies * I don't think we actually need the layer_num scaling trick * No need for duplication * Try to fix beam_search * Fix beam search * Removing layer num normalization seems to be breaking * Not sure self.layer_number normalization actually matters * Try and be backward compatible * Try to fix beam_search * Revert attempt to be backward compatible * Improve documentation on past_key_values format * Optimize the device allocation in case of hidden_states in multiple devices * No need to manually cast the values to a specific device * Rename with long version of variables * Improve type hinting * Add comment that explains that some methods return views * Actually i think the attention casting only makes sense when we use torch.float16 * We don't actually need layer_number to be passed anymore * Fix FX test * Bypass torch.baddbmm * Apply suggestions from code review * Add comment about support for torchScript v1.11 * fix ONNX support for bloom (#18456) Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

* Cleanup some code * Improve signatures * Try to reduce the number of reshape/copies * I don't think we actually need the layer_num scaling trick * No need for duplication * Try to fix beam_search * Fix beam search * Removing layer num normalization seems to be breaking * Not sure self.layer_number normalization actually matters * Try and be backward compatible * Try to fix beam_search * Revert attempt to be backward compatible * Improve documentation on past_key_values format * Optimize the device allocation in case of hidden_states in multiple devices * No need to manually cast the values to a specific device * Rename with long version of variables * Improve type hinting * Add comment that explains that some methods return views * Actually i think the attention casting only makes sense when we use torch.float16 * We don't actually need layer_number to be passed anymore * Fix FX test * Bypass torch.baddbmm * Apply suggestions from code review * Add comment about support for torchScript v1.11 * fix ONNX support for bloom (huggingface#18456) Co-authored-by: Niklas Muennighoff <n.muennighoff@gmail.com> Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

NouamaneTazi added 3 commits August 3, 2022 14:46

replace triu operation for ONNX support

5f4d510

remove in-place addition

0fbe776

because `seq_length` could be a tensor and we don't want to change its value

fix ONNX config past shapes

106a775

NouamaneTazi requested review from michaelbenayoun and thomasw21 August 3, 2022 15:27

thomasw21 approved these changes Aug 3, 2022

View reviewed changes

src/transformers/models/bloom/configuration_bloom.py Outdated Show resolved Hide resolved

nit: refactor head_dim

cf04a16

thomasw21 reviewed Aug 3, 2022

View reviewed changes

src/transformers/models/bloom/modeling_bloom.py Show resolved Hide resolved

add comment for why we replaced triu

5ff1ff2

thomasw21 merged commit 213ec2c into thomas/bloom_clean_code Aug 4, 2022

thomasw21 deleted the bloom_onnx_2 branch August 4, 2022 08:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix ONNX support for bloom #18456

fix ONNX support for bloom #18456

NouamaneTazi commented Aug 3, 2022 •

edited by thomasw21

Loading

thomasw21 left a comment

HuggingFaceDocBuilderDev commented Aug 3, 2022 •

edited

Loading

fix ONNX support for bloom #18456

fix ONNX support for bloom #18456

Conversation

NouamaneTazi commented Aug 3, 2022 • edited by thomasw21 Loading

thomasw21 left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Aug 3, 2022 • edited Loading

NouamaneTazi commented Aug 3, 2022 •

edited by thomasw21

Loading

HuggingFaceDocBuilderDev commented Aug 3, 2022 •

edited

Loading