Add is_mean param for mean op #40757

AnnaTrainingG · 2022-03-21T05:22:30Z

PR types

Others

PR changes

OPs

Describe

Add is_mean param for mean op
1.针对mean OP 添加is_mean参数，保证完成所以数据求和之后再进行除法操作。
2.修改reduceHigher的grid配置，当grid.z > 65536时候设置reduce_type为reduceAny
修改背景，1. fp16状态下，模型出现nan， 2，模型case[6600,:,:] 计算错误 axis = 1

paddle-bot-old · 2022-03-21T05:22:59Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

ZzSean · 2022-03-21T06:24:02Z

paddle/fluid/operators/mean_op.cu

@@ -65,9 +65,10 @@ class MeanCUDAKernel : public framework::OpKernel<T> {
    for (decltype(rank) i = 0; i < rank; ++i) {
      reduce_dims.push_back(i);
    }
-    TensorReduceImpl<T, T, kernel_primitives::AddFunctor, Div>(
-        context.cuda_device_context(), *input, output, Div(numel), reduce_dims,


这里前面定义的Div是不是可以删掉了

已经删除了调用的是IdentityFunctor

ZzSean · 2022-03-21T06:26:36Z

paddle/phi/kernels/funcs/reduce_function.h

@@ -657,6 +658,9 @@ __global__ void ReduceAnyKernel(const Tx* x,
  // the last dim gets involved in reduction
  int store_offset = 0;
  int stride_left = 0;
+  auto Final =
+      is_mean ? kps::DivideFunctor<MPType> : kps::IdentityFunctor<MPType>;
+  auto final_opt = Final(reduce_num);


opt一般指的是optimize的缩写吧，如果这里含义是output建议直接用out或者output

不是out，是operat，进行除法操作，或者是直接返回，只针对最后的store数据进行操作

ZzSean · 2022-03-21T06:30:14Z

paddle/phi/kernels/funcs/reduce_function.h

@@ -657,6 +658,9 @@ __global__ void ReduceAnyKernel(const Tx* x,
  // the last dim gets involved in reduction
  int store_offset = 0;
  int stride_left = 0;
+  auto Final =
+      is_mean ? kps::DivideFunctor<MPType> : kps::IdentityFunctor<MPType>;


对于is_mean为false的情况，感觉不用再用IdentityFunctor算一遍了，只在最后把reduce_var除一下就可以

已经修改

zkh2016 · 2022-03-24T01:57:20Z

paddle/fluid/operators/reduce_ops/reduce_op.cu.h

@@ -33,12 +33,12 @@ void TensorReduceImpl(const platform::CUDADeviceContext& dev_ctx,
                      const framework::Tensor& x, framework::Tensor* y,
                      const TransformOp& transform,
                      const std::vector<int>& origin_reduce_dims,
-                      gpuStream_t stream) {
+                      gpuStream_t stream, bool is_mean = false) {


is_mean要不要加上const

好的下个PR再修改

ZzSean

LGTM

ZzSean reviewed Mar 21, 2022

View reviewed changes

AnnaTrainingG added 4 commits March 23, 2022 07:32

Add is_mean param for mean op

c62dda9

update

15988d6

update config

1060cd8

update function.h

87fb28d

AnnaTrainingG force-pushed the mean_nan branch from 25f5d89 to 87fb28d Compare March 23, 2022 13:34

zkh2016 reviewed Mar 24, 2022

View reviewed changes

zkh2016 approved these changes Mar 24, 2022

View reviewed changes

xingfeng01 approved these changes Mar 24, 2022

View reviewed changes

AnnaTrainingG merged commit 7e1155e into PaddlePaddle:develop Mar 24, 2022

ZzSean approved these changes Mar 24, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add is_mean param for mean op #40757

Add is_mean param for mean op #40757

AnnaTrainingG commented Mar 21, 2022 •

edited

Loading

paddle-bot-old bot commented Mar 21, 2022

ZzSean Mar 21, 2022

AnnaTrainingG Mar 21, 2022

ZzSean Mar 21, 2022

AnnaTrainingG Mar 21, 2022

ZzSean Mar 21, 2022 •

edited

Loading

AnnaTrainingG Mar 21, 2022 •

edited

Loading

zkh2016 Mar 24, 2022

AnnaTrainingG Mar 24, 2022

ZzSean left a comment

Add is_mean param for mean op #40757

Add is_mean param for mean op #40757

Conversation

AnnaTrainingG commented Mar 21, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Mar 21, 2022

ZzSean Mar 21, 2022

Choose a reason for hiding this comment

AnnaTrainingG Mar 21, 2022

Choose a reason for hiding this comment

ZzSean Mar 21, 2022

Choose a reason for hiding this comment

AnnaTrainingG Mar 21, 2022

Choose a reason for hiding this comment

ZzSean Mar 21, 2022 • edited Loading

Choose a reason for hiding this comment

AnnaTrainingG Mar 21, 2022 • edited Loading

Choose a reason for hiding this comment

zkh2016 Mar 24, 2022

Choose a reason for hiding this comment

AnnaTrainingG Mar 24, 2022

Choose a reason for hiding this comment

ZzSean left a comment

Choose a reason for hiding this comment

AnnaTrainingG commented Mar 21, 2022 •

edited

Loading

ZzSean Mar 21, 2022 •

edited

Loading

AnnaTrainingG Mar 21, 2022 •

edited

Loading