add support for opimum bettertransformers #92

winglian · 2023-05-27T21:58:03Z

https://pytorch.org/blog/out-of-the-box-acceleration/

testing initial support for gpt-neox arch

NanoCode012 · 2023-05-28T02:42:03Z

I checked that Pytorch link. This is not a torch 2.0 feature only right?

Edit: I also see a float16. This is different from fp16?

winglian · 2023-05-28T12:46:55Z

so I had to add float16 as an option b/c there is no way to have the model load as float16 currently without enabling automatic mixed precision which kicks in when you pass fp16 or bf16 to the trainer.

winglian · 2023-05-28T12:48:17Z

might be easiest to warn or raise a ValueError if they are using torch.__version__.split(".")[0] < 2 ?

NanoCode012 · 2023-05-28T13:02:53Z

might be easiest to warn or raise a ValueError if they are using torch.__version__.split(".")[0] < 2 ?

I think this is good.

so I had to add float16 as an option b/c there is no way to have the model load as float16 currently without enabling automatic mixed precision which kicks in when you pass fp16 or bf16 to the trainer.

I just worry this will be confusing.. Could we just check for cfg.flash_optimum then handle it better instead of creating a new config?

winglian · 2023-05-28T14:44:59Z

what about cfg.flash_optimum_float32? this way we default to float16 when cfg.flash_optimum is set (the sane default), but if they really wanted float32, they can set it here.

NanoCode012 · 2023-05-28T14:47:39Z

what about cfg.flash_optimum_float32? this way we default to float16 when cfg.flash_optimum is set (the sane default), but if they really wanted float32, they can set it here.

Hm, by default, I think people would expect their code to be float32. Would cfg.flash_optimum_float16 be better?

Edit: Could you also add it to Readme in case I forget to later?

winglian · 2023-06-09T12:23:13Z

Hm, by default, I think people would expect their code to be float32. Would cfg.flash_optimum_float16 be better?

so in doing some additional experiments, there are cases where I need to explicitly load in float16 where I'm not using bettertransformers

NanoCode012

I think some parts have been discussed before, but not sure if we decided on something.

examples/pythia-12b/config.yml

scripts/finetune.py

src/axolotl/utils/data.py

src/axolotl/utils/models.py

src/axolotl/utils/trainer.py

src/axolotl/utils/validation.py

examples/pythia-12b/README.md

src/axolotl/utils/data.py

…el to train mode:

…in callback

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

…flash-optimum add support for opimum bettertransformers

winglian force-pushed the flash-optimum branch from 6c7af9b to d84ae82 Compare May 28, 2023 12:56

winglian added the enhancement New feature or request label May 28, 2023

winglian force-pushed the flash-optimum branch from d84ae82 to bdb547b Compare May 31, 2023 20:41

winglian force-pushed the flash-optimum branch from 6fcb73f to ee74fd7 Compare June 9, 2023 01:42

NanoCode012 reviewed Jun 9, 2023

View reviewed changes

utensil reviewed Jun 10, 2023

View reviewed changes

examples/pythia-12b/README.md Outdated Show resolved Hide resolved

NanoCode012 reviewed Jun 10, 2023

View reviewed changes

src/axolotl/utils/data.py Outdated Show resolved Hide resolved

winglian added 11 commits June 10, 2023 14:22

add support for opimum bettertransformers

1edc30c

add flash attn context for efficient training and attempt setting mod…

8792199

…el to train mode:

use pythia-12b, neox-20b is flaky

3961902

add validation/warning for bettertransformers and torch version

71a43f8

experimental expansion of ctx len

488a67d

more tweaks to do pre-training with bettertransformers

1210dc8

fix bettertransformers save, force it to skip after saving correctly …

1a82082

…in callback

more gpt-neox long ctx fixes

ab5cd28

linting fix

1db46a9

add streaming dataset support for pretraining datasets

eea2731

address PR feedback

0c6f928

winglian force-pushed the flash-optimum branch from efd6d53 to 0c6f928 Compare June 10, 2023 18:24

winglian and others added 4 commits June 10, 2023 14:25

Update scripts/finetune.py

759e867

Co-authored-by: NanoCode012 <kevinvong@rocketmail.com>

fix formatting

958da70

add check for attr

c9a149f

Merge branch 'main' into flash-optimum

fd2c981

winglian force-pushed the flash-optimum branch from 068deea to fd2c981 Compare June 13, 2023 19:27

winglian merged commit 16bb627 into main Jun 14, 2023

winglian deleted the flash-optimum branch June 15, 2023 12:31

mkeoliya pushed a commit to mkeoliya/axolotl that referenced this pull request Dec 15, 2023

Merge pull request axolotl-ai-cloud#92 from OpenAccess-AI-Collective/…

a779efd

…flash-optimum add support for opimum bettertransformers

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add support for opimum bettertransformers #92

add support for opimum bettertransformers #92

winglian commented May 27, 2023 •

edited

Loading

NanoCode012 commented May 28, 2023 •

edited

Loading

winglian commented May 28, 2023

winglian commented May 28, 2023

NanoCode012 commented May 28, 2023

winglian commented May 28, 2023

NanoCode012 commented May 28, 2023 •

edited

Loading

winglian commented Jun 9, 2023

NanoCode012 left a comment

add support for opimum bettertransformers #92

add support for opimum bettertransformers #92

Conversation

winglian commented May 27, 2023 • edited Loading

NanoCode012 commented May 28, 2023 • edited Loading

winglian commented May 28, 2023

winglian commented May 28, 2023

NanoCode012 commented May 28, 2023

winglian commented May 28, 2023

NanoCode012 commented May 28, 2023 • edited Loading

winglian commented Jun 9, 2023

NanoCode012 left a comment

Choose a reason for hiding this comment

winglian commented May 27, 2023 •

edited

Loading

NanoCode012 commented May 28, 2023 •

edited

Loading

NanoCode012 commented May 28, 2023 •

edited

Loading