BART FP16 #3117

astariul · 2020-03-04T02:52:27Z

🚀 Feature request

I would like to use BART in FP16 mode, but it seems impossible for now :

config = BartConfig(vocab_size=50264, output_past=True)
model = AutoModelWithLMHead.from_pretrained('bart-large-cnn', config=config).cuda().half()
tokenizer = AutoTokenizer.from_pretrained('bart-large-cnn')
ARTICLE_TO_SUMMARIZE = "My friends are cool but they eat too many carbs."
inputs = tokenizer.batch_encode_plus([ARTICLE_TO_SUMMARIZE], max_length=1024, return_tensors='pt')
generated_ids = model.generate(inputs['input_ids'].cuda(), attention_mask=inputs['attention_mask'].cuda(), num_beams=4, max_length=5)

File "/data/user/.venv/bartqg/lib/python3.6/site-packages/transformers/modeling_bart.py", line 647, in forward
attn_output = torch.bmm(attn_probs, v)
RuntimeError: Expected object of scalar type Float but got scalar type Half for argument #2 'mat2' in call to _th_bmm

@sshleifer Do you plan to implement a FP16-friendly version of BART ?

The text was updated successfully, but these errors were encountered:

sshleifer · 2020-03-04T16:48:47Z

Not on my roadmap just yet, but I would definitely consider it if there were lots of demand. Since we only have inference code right now, the benefit seems marginal.

astariul · 2020-03-05T10:30:39Z

@BramVanroy Should this issue be closed ?

FP16 is not implemented yet. And the wontfix label is clear.

Keeping the issue open may make it easier for people to find it and show their potential interest in FP16.

thomwolf · 2020-03-05T11:46:59Z

This should not be closed indeed.

@sshleifer, we intend all the models to be compatible with FP16, this is the direction the field is going and with the Volta-level GPU being widespread now, there is less and less reason not to use mixed-precision fine-tuning (half memory and significantly faster).

thomwolf · 2020-03-05T12:11:27Z

This can probably be fixed by changing the torch.float32 casting here to a cast to the type of attn_weights like it's done in the original fairseq code here.

Do you mind fixing this and testing the failing script posted in the issue @sshleifer?

sshleifer · 2020-03-05T14:51:29Z

Yep, on it!

easonnie · 2020-03-05T15:04:09Z

Hi, @sshleifer. Thank you so much for your effort on BART. I encountered the same fp16 issues today. The current BART code can be trained (without fp16) using the run_glue script in: https://github.com/huggingface/transformers/blob/master/examples/run_glue.py
So, it will be really nice if the fp16 training can also work out.

BramVanroy · 2020-03-05T22:10:33Z

My bad, I thought @sshleifer's labeling was a note that he isn't planning to change anything wontfix, so no future updates would be possible and then I closed it. Will keep that in mind for the future.

thomwolf · 2020-03-06T11:57:22Z

No bad

@sshleifer for the moment, please ping me with DM before adding "wontfix" labels to issues, thanks.

sshleifer self-assigned this Mar 4, 2020

sshleifer added the wontfix label Mar 4, 2020

BramVanroy closed this as completed Mar 5, 2020

thomwolf reopened this Mar 5, 2020

stale bot removed the wontfix label Mar 5, 2020

sshleifer linked a pull request Mar 5, 2020 that will close this issue

[Bart] FP16 Support #3145

Merged

thomwolf closed this as completed in #3145 Mar 5, 2020

AOZMH mentioned this issue Mar 12, 2020

Using FP16 on BartModel #3249

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BART FP16 #3117

BART FP16 #3117

astariul commented Mar 4, 2020 •

edited

Loading

sshleifer commented Mar 4, 2020

astariul commented Mar 5, 2020

thomwolf commented Mar 5, 2020 •

edited

Loading

thomwolf commented Mar 5, 2020

sshleifer commented Mar 5, 2020

easonnie commented Mar 5, 2020

BramVanroy commented Mar 5, 2020

thomwolf commented Mar 6, 2020

BART FP16 #3117

BART FP16 #3117

Comments

astariul commented Mar 4, 2020 • edited Loading

🚀 Feature request

sshleifer commented Mar 4, 2020

astariul commented Mar 5, 2020

thomwolf commented Mar 5, 2020 • edited Loading

thomwolf commented Mar 5, 2020

sshleifer commented Mar 5, 2020

easonnie commented Mar 5, 2020

BramVanroy commented Mar 5, 2020

thomwolf commented Mar 6, 2020

astariul commented Mar 4, 2020 •

edited

Loading

thomwolf commented Mar 5, 2020 •

edited

Loading