-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support KL2 multi-card training, *test=kunlun #43889
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -60,6 +60,7 @@ XPUOpMap& get_kl2_ops() { | |
XPUKernelSet({pOpKernelType(vartype::FP32, XPUPlace())})}, | ||
{"bilinear_interp_v2_grad", | ||
XPUKernelSet({pOpKernelType(vartype::FP32, XPUPlace())})}, | ||
{"broadcast", XPUKernelSet({pOpKernelType(vartype::FP32, XPUPlace())})}, | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. fluid/operator/下没看到具体的*_op_xpu.cc文件和对应单测文件,和调用broadcast的地方没看到,是不是可以不用加 There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. xpu调broadcast的地方在dygraph/layers.py:_dygraph_call_func() There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 不需要加all_reduce吗? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. 暂时不需要,allreduce现在是直接调用bkcl的接口,没有封装 |
||
{"cast", | ||
XPUKernelSet({pOpKernelType(vartype::FP32, XPUPlace()), | ||
pOpKernelType(vartype::FP16, XPUPlace()), | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个要求runtime和驱动版本比较新,才能支持xpu_event_record,对吧?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
xpu_event_record 我看20年就有了,3994660c (hanjinchen 2020-07-03