Causal attention hard-codes lower triangular matrices as uint8 #636

fxmarty · 2022-12-22T17:28:11Z

System Info

- `optimum` version: 1.5.3.dev0
- `transformers` version: 4.25.1
- Platform: Linux-5.15.0-56-generic-x86_64-with-glibc2.35
- Python version: 3.9.12
- Huggingface_hub version: 0.12.0.dev0
- PyTorch version (GPU?): 1.13.1+cu117 (cuda availabe: True)
- Tensorflow version (GPU?): 2.9.1 (cuda availabe: True)

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Not necessarily a bug per se. Likely due to e.g. https://github.com/huggingface/transformers/blob/4d10ffd50614ebcdfe7e99c0092740f0f7234923/src/transformers/models/gpt2/modeling_gpt2.py#L196 . Similar for other models.

optimum-cli export onnx --model gpt2 --task causal-lm --for-ort gpt2_onnx

Expected behavior

/

The text was updated successfully, but these errors were encountered:

fxmarty · 2022-12-22T17:28:37Z

Related: #627 #605

fxmarty · 2022-12-29T14:57:48Z

Actually comes from https://github.com/huggingface/transformers/blob/4d10ffd50614ebcdfe7e99c0092740f0f7234923/src/transformers/models/gpt2/modeling_gpt2.py#L130 .

I have no idea why torch.uint8 is used and not torch.bool here. Any idea @LysandreJik @sgugger ?

sgugger · 2022-12-30T07:14:40Z

My best guess would be because it's very old code and PyTorch did not support torch.bool back then.

fxmarty · 2023-01-05T10:46:00Z

Closing as in any case pytorch represents torch.bool on a byte (pytorch/pytorch#41571 (comment)), and ONNX as well (checking the initializer ByteSize()).

fxmarty added the onnx Related to the ONNX export label Dec 22, 2022

fxmarty mentioned this issue Dec 29, 2022

(ONNXRuntimeError) LoadLibrary failed with error 126 #618

Closed

4 tasks

fxmarty closed this as completed Jan 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Causal attention hard-codes lower triangular matrices as uint8 #636

Causal attention hard-codes lower triangular matrices as uint8 #636

fxmarty commented Dec 22, 2022

fxmarty commented Dec 22, 2022 •

edited

Loading

fxmarty commented Dec 29, 2022

sgugger commented Dec 30, 2022

fxmarty commented Jan 5, 2023

Causal attention hard-codes lower triangular matrices as uint8 #636

Causal attention hard-codes lower triangular matrices as uint8 #636

Comments

fxmarty commented Dec 22, 2022

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

fxmarty commented Dec 22, 2022 • edited Loading

fxmarty commented Dec 29, 2022

sgugger commented Dec 30, 2022

fxmarty commented Jan 5, 2023

fxmarty commented Dec 22, 2022 •

edited

Loading