FEAT / Trainer: Add adamw 4bit optimizer #31865

SunMarc · 2024-07-09T14:15:50Z

What does this PR do ?

This PR adds the 4-bit optimizer from torchao library into HF Trainer. For now, it requires the main branch of torchao and torch >=2.3 (maybe we can wait a bit before merging). For those who wants to try, you can pass optim="adamw_torch_4bit" in TrainingArguments.

Since we already have the 8-bit optimizer from bnb that works well, i'm not adding it.

Related thread : https://x.com/marksaroufim/status/1809398186198593566

cc @muellerzr as you might be interested

HuggingFaceDocBuilderDev · 2024-07-09T14:36:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

muellerzr

Nice! LG2M, cc @msaroufim :)

msaroufim · 2024-07-09T15:05:33Z

There's also an AdamWFp8 btw and it's the fastest one we've found when the HW supports it https://github.com/pytorch/ao/tree/main/torchao/prototype/low_bit_optim#benchmarks

Also cc @gau-nernst this is very exciting!

amyeroberts

Thanks for adding!

SunMarc · 2024-07-10T11:39:50Z

There's also an AdamWFp8 btw and it's the fastest one we've found when the HW supports it https://github.com/pytorch/ao/tree/main/torchao/prototype/low_bit_optim#benchmarks

Nice ! I'll add it in a separate PR !

This reverts commit 25278e8.

msaroufim · 2024-08-07T22:59:31Z

Heads up @SunMarc we just released torchao 0.4! https://github.com/pytorch/ao/releases/tag/v0.4.0

SunMarc · 2024-08-08T11:58:07Z

Nice ! I'll merge it as soon as we merge the torchao quantization PR in transformers as there is some overlap !

* add 4bit optimizer * style * fix msg * style * add qgalore * Revert "add qgalore" This reverts commit 25278e8. * style * version check

SunMarc added 2 commits July 9, 2024 16:04

add 4bit optimizer

5e77ac6

style

6e0fd52

SunMarc requested review from amyeroberts and muellerzr July 9, 2024 14:20

muellerzr approved these changes Jul 9, 2024

View reviewed changes

amyeroberts approved these changes Jul 9, 2024

View reviewed changes

gau-nernst mentioned this pull request Jul 10, 2024

4bit Adam #30172

Open

SunMarc added 4 commits July 10, 2024 16:48

fix msg

ddb28e5

style

3dca6a2

add qgalore

25278e8

Revert "add qgalore"

11d69aa

This reverts commit 25278e8.

SunMarc added 7 commits August 19, 2024 16:37

Merge remote-tracking branch 'upstream/main' into add-4bit-optim

a02e73b

style

d87f1dc

version check

0c7c50d

Merge remote-tracking branch 'upstream/main' into add-4bit-optim

a076ef0

Merge remote-tracking branch 'upstream/main' into add-4bit-optim

d7590db

Merge remote-tracking branch 'upstream/main' into add-4bit-optim

7eab41a

Merge remote-tracking branch 'upstream/main' into add-4bit-optim

3cad397

SunMarc merged commit c42d264 into main Aug 22, 2024
24 checks passed

SunMarc deleted the add-4bit-optim branch August 22, 2024 13:07

msaroufim mentioned this pull request Aug 22, 2024

4 bit Adam should support non constant lr pytorch/ao#730

Closed

gau-nernst mentioned this pull request Aug 27, 2024

Low bit optimizers quality pytorch/ao#744

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT / Trainer: Add adamw 4bit optimizer #31865

FEAT / Trainer: Add adamw 4bit optimizer #31865

SunMarc commented Jul 9, 2024 •

edited by muellerzr

Loading

HuggingFaceDocBuilderDev commented Jul 9, 2024

muellerzr left a comment

msaroufim commented Jul 9, 2024 •

edited

Loading

amyeroberts left a comment

SunMarc commented Jul 10, 2024

msaroufim commented Aug 7, 2024

SunMarc commented Aug 8, 2024

FEAT / Trainer: Add adamw 4bit optimizer #31865

FEAT / Trainer: Add adamw 4bit optimizer #31865

Conversation

SunMarc commented Jul 9, 2024 • edited by muellerzr Loading

What does this PR do ?

HuggingFaceDocBuilderDev commented Jul 9, 2024

muellerzr left a comment

Choose a reason for hiding this comment

msaroufim commented Jul 9, 2024 • edited Loading

amyeroberts left a comment

Choose a reason for hiding this comment

SunMarc commented Jul 10, 2024

msaroufim commented Aug 7, 2024

SunMarc commented Aug 8, 2024

SunMarc commented Jul 9, 2024 •

edited by muellerzr

Loading

msaroufim commented Jul 9, 2024 •

edited

Loading