Removing the drop and gathers in depth tensor parallelism for the easy API #66

siddharth9820 · 2024-02-28T00:19:54Z

Realized that these are unnecessary. Instead the drop can happen inside the dataloader if we shard it correctly. In the case of lightning or accelerate (a PR making this change is here - axonn-ai/accelerate#2), this sharding will be handled by the backend. For users directly using axonn (like improved-diffusion), we must provide some api for them to configure their dataloaders.

…into easy-api-remove-dtp-comm

…y API (#66)

remove dtp drop and gathers

7d1ceba

siddharth9820 changed the title ~~remove dtp drop and gathers~~ Removing the drop and gathers in depth tensor parallelism for the easy API Feb 28, 2024

siddharth9820 added 5 commits February 27, 2024 19:22

Merge branch 'develop' into easy-api-remove-dtp-comm

045a7ee

correct tests

afe8b8b

Merge branch 'easy-api-remove-dtp-comm' of github.com:axonn-ai/axonn …

686671a

…into easy-api-remove-dtp-comm

correct small bug in ci

3c07267

use sync gradients

99be26e

This was referenced Feb 28, 2024

Change data partitioning huggingface/accelerate#2499

Closed

partition data across depth tensor parallel ranks axonn-ai/accelerate#2

Merged

siddharth9820 merged commit f975e58 into develop Feb 28, 2024
6 checks passed

siddharth9820 deleted the easy-api-remove-dtp-comm branch February 28, 2024 01:14

Avuxon pushed a commit that referenced this pull request Apr 12, 2024

Removing the drop and gathers in depth tensor parallelism for the eas…

7728f1a

…y API (#66)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removing the drop and gathers in depth tensor parallelism for the easy API #66

Removing the drop and gathers in depth tensor parallelism for the easy API #66

siddharth9820 commented Feb 28, 2024 •

edited

Loading

Removing the drop and gathers in depth tensor parallelism for the easy API #66

Removing the drop and gathers in depth tensor parallelism for the easy API #66

Conversation

siddharth9820 commented Feb 28, 2024 • edited Loading

siddharth9820 commented Feb 28, 2024 •

edited

Loading