-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Completes basic dtypes for collective api in eager mode #45574
Completes basic dtypes for collective api in eager mode #45574
Conversation
你的PR提交成功,感谢你对开源项目的贡献! |
a28b1e1
to
d0105dd
Compare
""" | ||
if group is not None and not group.is_member(): | ||
return | ||
dst = _get_group_rank(dst, group) | ||
if in_dygraph_mode(): | ||
group = _get_default_group() if group is None else group | ||
backend = _group_map_backend[group] | ||
assert backend != 'gloo', ("backend gloo is not supported yet") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
最新的gloo里是支持send,recv和alltoall的,不过我们现在的gloo是野分支,后面升级之后可以一起改。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
新版本gloo实现了alltoall,send和recv未实现,可以在后续一起修改
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
gloo似乎也没有实现reduce_scatter,后续可以一起改了
0bbdc91
to
f13e16d
Compare
python/paddle/fluid/tests/unittests/collective/test_collective_sendrecv_api.py
Show resolved
Hide resolved
15a0a17
to
da10aa2
Compare
…r broadcast, reduce, scatter op in eager mode
…r alltoall, sendrecv op in eager mode
…r alltoall_single, reduce_scatter op in eager mode
da10aa2
to
733efa6
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM for PROPERTIES TIMEOUT "300" LABELS "RUN_TYPE=DIST"
后续pr会降低单测执行时间
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
PR types
New features
PR changes
OPs
Describe
This pr completes the basic function of communication framework, support various data types.
通信框架功能进一步补全,通信操作支持传输丰富的数据类型。
TODO: