Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support model size calculation for FSDP & DeepSpeed (stage-3) #10291

Open
rohitgr7 opened this issue Nov 1, 2021 · 0 comments
Open

Support model size calculation for FSDP & DeepSpeed (stage-3) #10291

rohitgr7 opened this issue Nov 1, 2021 · 0 comments
Assignees
Labels
feature Is an improvement or enhancement strategy: deepspeed strategy: fsdp Fully Sharded Data Parallel

Comments

@rohitgr7
Copy link
Contributor

rohitgr7 commented Nov 1, 2021

🚀 Feature

Currently LightningModule.model_size (deprecated) relies on:
https://github.com/PyTorchLightning/pytorch-lightning/blob/45c45dc7b018f9a2db60f5df1a3f7dbbb45ccb36/pytorch_lightning/utilities/memory.py#L165-L179

to calculate model size. Since we are going to keep get_model_size_mb utility function, we should have a way to calculate model_sizes for sharded models.

cc @Borda @awaelchli @carmocca @SeanNaren @rohitgr7 @akihironitta

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Is an improvement or enhancement strategy: deepspeed strategy: fsdp Fully Sharded Data Parallel
Projects
None yet
Development

No branches or pull requests

3 participants