[NPU] support npu for memcpy op #31808

zhiqiu · 2021-03-23T07:09:16Z

PR types

New features

PR changes

OPs

Describe

[NPU] support npu for memcpy op

paddle-bot-old · 2021-03-23T07:09:18Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

pangyoki

LGTM

pangyoki · 2021-03-30T03:45:38Z

python/paddle/fluid/tests/unittests/npu/test_memcpy_op_npu.py

+                outputs={"Out": cpu_var_name},
+                attrs={
+                    "shape": [10, 10],
+                    "dtype": npu_var.dtype,


not important. cpu_var.dtype

liym27 · 2021-03-30T03:48:57Z

python/paddle/fluid/tests/unittests/npu/test_memcpy_op_npu.py

+        self.assertTrue(np.allclose(npu_, cpu_))
+        self.assertTrue(np.allclose(cpu_, np.ones((10, 10))))
+
+    def test_cpu_cpoy_gpu(self):


test_cpu_cpoy_gpu -> test_cpu_cpoy_npu

liym27

LGTM

* support npu for memcpy op * add ut * fix ut * fix typo

…to develop (#32294) * [NPU] support GarbageCollector for npu (#31874) * support GarbageCollector for npu * fix typo * fix gather_grad * disable NPUDefaultStreamGarbageCollector on NPU * [NPU] support npu for memcpy op (#31808) * support npu for memcpy op * add ut * fix ut * fix typo * 【NPU】fix bug of using temp vector (#31963) * fix bug when beta1_pow on cpu (#31995) * [NPU] support npu profiler (#31684) * support npu profiler * add python api * fix bugs * add wrapper for incomplete type * update profile proto * record npu wait * add xpu placeholder * fix adam (#32016) * [NPU] enable async copy and add wait before sync operation (#31956) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * make TensorFromVector/TensorToVector sync * [NPU] Support dataloader on npu place. (#31867) * [NPU] Wait on NPUPlace (#32086) * [NPU] fix cast op (#32121) * fix npu kernel of cast op to handle casting to same dtype * add comments * [NPU] support cann 20.3 (#32044) * fix compile problem on cann 20.3 * fix ut * fix test_mul * fix check_finite_and_scale * fix lookup_table_v2_grad * fix cmake * support print op * [NPU] Support npu save load (#31893) * support save load for NPU * add save load npu unittest * support np.array transform in NPU * fix errors * delete dygraph in unittest * add Wait * fix unittest * fix review comment * fix unittest problem * fix little problem * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performance (#32196) * change aclrtSynchronizeDevice to aclrtSynchronizeStream for better performace * refine code * fix NPUDeviceContext in all c++ unittest (#32198) * fix NPUDeviceContext in all c++ unittest * refine log Co-authored-by: pangyoki <pangyoki@126.com> * [NPU] Remove TensorFromVector and avoid sync copy in npu op kernel for better performance (#31994) * enable async copy and add wait before sync operation * remove unneccessary wait * add FillNpuTensorWithConstant * refine * fix fill_constant * change TensorFromVector to FillNpuTensorWithConstant * fix ignored api * delete extra unittest * fix little error * fix update_loss_scaling_op_npu and check_finite_and_unscale_op_npu * change TensorCopySync to TensorCopy * delete useless Wait and add StreamWait * fix npu_stream error * fix check_finite_and_unscale_op_npu TensorCopy * only save stream wait * fix NPUDeviceContext in all c++ unittest * delete wait Co-authored-by: zhiqiu <chenqiuliang@baidu.com> * delete useless unittest file (#32206) * Fix op test (#32231) * fix conditional block (#32243) * fix adam bug again (#32246) * fix compile * fix ut * fix ut Co-authored-by: liym27 <33742067+liym27@users.noreply.github.com> Co-authored-by: pangyoki <pangyoki@126.com>

support npu for memcpy op

d919666

zhiqiu added 2 commits March 23, 2021 07:18

add ut

75ed850

fix ut

f2ed485

pangyoki previously approved these changes Mar 30, 2021

View reviewed changes

liym27 reviewed Mar 30, 2021

View reviewed changes

fix typo

d60c144

zhiqiu dismissed pangyoki’s stale review via d60c144 March 30, 2021 04:00

liym27 approved these changes Mar 30, 2021

View reviewed changes

zhiqiu merged commit a6343af into PaddlePaddle:ascendrc Mar 30, 2021

zhiqiu added a commit to zhiqiu/Paddle that referenced this pull request Apr 15, 2021

[NPU] support npu for memcpy op (PaddlePaddle#31808)

f5c50b5

* support npu for memcpy op * add ut * fix ut * fix typo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU] support npu for memcpy op #31808

[NPU] support npu for memcpy op #31808

zhiqiu commented Mar 23, 2021 •

edited

Loading

paddle-bot-old bot commented Mar 23, 2021

pangyoki left a comment

pangyoki Mar 30, 2021

liym27 Mar 30, 2021

liym27 left a comment

[NPU] support npu for memcpy op #31808

[NPU] support npu for memcpy op #31808

Conversation

zhiqiu commented Mar 23, 2021 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Mar 23, 2021

pangyoki left a comment

Choose a reason for hiding this comment

pangyoki Mar 30, 2021

Choose a reason for hiding this comment

liym27 Mar 30, 2021

Choose a reason for hiding this comment

liym27 left a comment

Choose a reason for hiding this comment

zhiqiu commented Mar 23, 2021 •

edited

Loading