shift torch dynamo handling to accelerate #23168

pacman100 · 2023-05-05T12:11:19Z

What does this PR do?

Shits the torch dynamo handling to accelerate
Should be merged after move fsdp handling to accelerate #23158
No user facing change. Now, users can use accelerate launch for torch dynamo, e.g.,

accelerate launch --dynamo_backend=inductor ./examples/pytorch/text-classification/run_glue.py   --model_name_or_path bert-base-cased   --task_name $TASK_NAME   --do_train   --do_eval   --max_seq_length 128   --per_device_train_batch_size 32   --learning_rate 2e-5   --num_train_epochs 3   --output_dir ~/temp/$TASK_NAME/ --fp16 --overwrite_output_dir --pad_to_max_length --dataloader_drop_last

Current usage like below is unimpacted:

python ./examples/pytorch/text-classification/run_glue.py   --model_name_or_path bert-base-cased   --task_name $TASK_NAME   --do_train   --do_eval   --max_seq_length 128   --per_device_train_batch_size 32   --learning_rate 2e-5   --num_train_epochs 3   --output_dir ~/temp/$TASK_NAME/ --fp16 --overwrite_output_dir --torch_compile --pad_to_max_length --dataloader_drop_last

HuggingFaceDocBuilderDev · 2023-05-05T12:27:46Z

The documentation is not available anymore as the PR was closed or merged.

muellerzr

Nice and clean, great!

sgugger

Thanks a lot!

* mixed precision support via accelerate * fix issues * fix for the sharded ddp case * fix flax and tf failing tests * `refactor the place to create `Accelerator` object * move ddp prep to accelerate * fix 😅 * resolving comments * move fsdp handling to accelerate * fixex * fix saving * shift torch dynamo handling to accelerate

pacman100 added 12 commits May 4, 2023 13:35

mixed precision support via accelerate

b3987a8

fix issues

862d04b

fix for the sharded ddp case

f2196be

fix flax and tf failing tests

2339a48

refactor the place to create Accelerator` object

263b134

move ddp prep to accelerate

a5bf517

fix 😅

f00ce09

resolving comments

254f9a4

move fsdp handling to accelerate

88e7350

fixex

b37ad2a

fix saving

ec73bf2

shift torch dynamo handling to accelerate

ed1a520

pacman100 requested review from sgugger and muellerzr May 5, 2023 12:11

muellerzr approved these changes May 5, 2023

View reviewed changes

sgugger approved these changes May 5, 2023

View reviewed changes

pacman100 mentioned this pull request May 9, 2023

accelerate deepspeed and gradient accumulation integrate #23236

Merged

Merge branch 'main' into smangrul/accelerate-dynamo-integrate

b2d9946

pacman100 changed the base branch from smangrul/accelerate-fsdp-integrate to main May 10, 2023 04:46

pacman100 changed the base branch from main to smangrul/accelerate-fsdp-integrate May 10, 2023 04:46

Base automatically changed from smangrul/accelerate-fsdp-integrate to main May 31, 2023 08:40

pacman100 added 2 commits May 31, 2023 14:15

Merge branch 'main' into smangrul/accelerate-dynamo-integrate

834ec38

Merge branch 'main' into smangrul/accelerate-dynamo-integrate

5367644

pacman100 merged commit 03db591 into main May 31, 2023

pacman100 deleted the smangrul/accelerate-dynamo-integrate branch May 31, 2023 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

shift torch dynamo handling to accelerate #23168

shift torch dynamo handling to accelerate #23168

pacman100 commented May 5, 2023

HuggingFaceDocBuilderDev commented May 5, 2023 •

edited

Loading

muellerzr left a comment

sgugger left a comment

shift torch dynamo handling to accelerate #23168

shift torch dynamo handling to accelerate #23168

Conversation

pacman100 commented May 5, 2023

What does this PR do?

HuggingFaceDocBuilderDev commented May 5, 2023 • edited Loading

muellerzr left a comment

Choose a reason for hiding this comment

sgugger left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented May 5, 2023 •

edited

Loading