-
Notifications
You must be signed in to change notification settings - Fork 5.9k
【FlexCheckpoint】Add ShardedTensor, Upgrade DCP, and Introduce AOA #74593
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【FlexCheckpoint】Add ShardedTensor, Upgrade DCP, and Introduce AOA #74593
Conversation
|
你的PR提交成功,感谢你对开源项目的贡献! |
7929113 to
1c91777
Compare
1c91777 to
391481c
Compare
Codecov Report❌ Patch coverage is ❌ Your patch status has failed because the patch coverage (46.74%) is below the target coverage (90.00%). You can increase the patch coverage or adjust the target coverage. Additional details and impacted files@@ Coverage Diff @@
## develop #74593 +/- ##
==========================================
Coverage ? 46.74%
==========================================
Files ? 13
Lines ? 1106
Branches ? 0
==========================================
Hits ? 517
Misses ? 589
Partials ? 0 ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
|
/re-run Distribute-stable |
From00
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM,一些建议可以在下个PR修改。
From00
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
risemeup1
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for setup.py
sunzhongkai588
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM,
我看 ShardedTensor API 都暴露了,中文文档也补齐一下
zyfncg
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for setup.py.in
e844a9d
4fe60f4 to
502d903
Compare
From00
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
XiaoguangHu01
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
sunzhongkai588
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM,文档问题后续补充
| "PrepareContextParallel", | ||
| "create_nccl_config", | ||
| "ShardedWeight", | ||
| "ShardedStateDict", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ShardedStateDict 似乎没有文档?
…ddlePaddle#74593) * add flex checkpoint * add aoa_engine test * replace left arrow with right arrow * fix api type check * fix __init__ * rename sharded_tensor to sharded_weight * fix path
PR Category
User Experience
PR Types
New features
Description
添加FlexCheckpoint主要模块,包括
pcard-73263