【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API #40472

Li-fAngyU · 2022-03-11T15:10:43Z

PR types

New features

PR changes

APIs

Describe

ISSUE链接:#40327
RFC的PR链接:PaddlePaddle/community#48
中文文档PR链接:PaddlePaddle/docs#4294
为 Paddle 新增 nanmean API

paddle.nanmean 扩展了 paddle.mean API 的功能，如果输入Tensor中有nan值， paddle.mean在计算中会将涉及nan值的结果都置为nan，而 paddle.nanmean 会跳过nan值。比如输入数据 x = [[nan, 1. , 2. ], [3. , 4. , 5. ]]，x.mean() 得到 [nan]，x.mean(0) 得到 [nan, 2.5, 3.5]，x.nanmean() 得到 [3.]，x.nanmean(0) 得到 [3., 2.5, 3.5]。此API需支持的调用路径为：paddle.nanmean 和 Tensor.nanmean 。

paddle-bot-old · 2022-03-11T15:11:07Z

Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

Update the nanmean example code

paddle-bot-old · 2022-03-18T05:16:30Z

PR格式检查通过，你的PR将接受Paddle专家以及开源社区的review，请及时关注PR动态。
The format inspection passed. Your PR will be reviewed by experts of Paddle and developers from the open-source community. Stay tuned.

jeff41404 · 2022-03-18T05:37:53Z

python/paddle/tensor/math.py

@@ -967,6 +967,81 @@ def nansum(x, axis=None, dtype=None, keepdim=False, name=None):
    return sum(tmp_tensor, axis, dtype, keepdim, name)


+def nanmean(x,axis=None,keepdim=None,name=None):


keepdim=False is better？keep same with other API？

thank you for your suggestion! keep same with other API is the better way. i will update this

jeff41404 · 2022-03-18T05:47:32Z

python/paddle/tensor/math.py

+        axis = [axis]
+    if axis == None:
+        return paddle.mean(x[~paddle.isnan(x)], keepdim=keepdim,name=name)
+    check_variable_and_dtype(x, 'x/input',


just 'x' instead of 'x/input'?

this is refer to the code of paddle.mean.
'x/input' seems to be used only as a input name when raise a type/dtype error.

jeff41404 · 2022-03-18T05:54:36Z

python/paddle/tensor/math.py

+    if axis == None:
+        return paddle.mean(x[~paddle.isnan(x)], keepdim=keepdim,name=name)
+    check_variable_and_dtype(x, 'x/input',
+                             ['uint16', 'float16', 'float32', 'float64'],


dtype should be the same to description of x above. eg: 'uint16' is not in "x (Tensor): The input Tensor with data type float32, float64."

This is refer to the code of paddle.mean
Although it require the input tensor must with data type of float32/64, but it also check allow data type of 'uint16' and 'float16' code is here.
Because of the issus describe here(paddle.nanmean extends the functionality of the paddle.mean API), so i just follow the code of paddle.mean

jeff41404 · 2022-03-18T06:17:29Z

python/paddle/tensor/math.py

+    if axis == None:
+        return paddle.mean(x[~paddle.isnan(x)], keepdim=keepdim,name=name)


can the logic of code L1041~L1042 below cover this branch? we need to handle the condition 'axis == None' alone?

thank you for your suggestion!
the logic of code L1041~L1042 below can cover this branch.
we don't need handle this condition alone.
i will take this advise and update this problem.

jeff41404 · 2022-03-18T06:29:17Z

python/paddle/fluid/tests/unittests/test_nanmean_api.py

+
+    def setUp(self):
+        self.x_shape = [2, 3, 4, 5]
+        self.x = np.random.uniform(-1, 1, self.x_shape).astype(np.float32)


self.x does not have 'nan', we should cover all the test case in rfcs
also should include check gradient

thanks for your suggestion! i will cover all the test case in next update.
but i am confused in check gradient, because i can't find the example of check gradient in the paddle.mean test file test_mean_op.py .
I will appreciate it if you can give me a example.

Ligoml · 2022-03-18T08:17:03Z

hi~有些细节需要注意一下 @Li-fAngyU
1、PR的标题要和ISSUE标题内容一样；
2、Describe 里要加上 ISSUE 链接 & RFC链接 & 中文文档链接

Li-fAngyU · 2022-04-02T05:01:26Z

@Ligoml Done!

修改了nanmean的axis参数的文档描述。

Ligoml

LGTM for docs

paddle-bot-old · 2022-04-02T06:20:24Z

祝贺你，你的PR测试通过，后续将会纳入飞桨的发版计划中，感谢你对飞桨开发者社区的参与。
Congratulations! The test passed and your PR will be released in our coming stable version. Thank you for your contribution to the open-source project of PaddlePaddle.

paddle-bot-old · 2022-04-02T06:23:25Z

你的PR有最新反馈，请及时修改。
There’s the latest feedback about your PR. Please check.

updata nanmean's sample code (:name: code-example1)

XiaoguangHu01

LGTM

修改nanmean的example code 错误

update example code

update example code of nanmean

Ligoml

LGTM for docs

XiaoguangHu01

LGTM

Li-fAngyU added 10 commits March 11, 2022 12:18

Update __init__.py

265e64c

Update math.py

50960a6

Create test_nanmean_api.py

9acc480

Update __init__.py

86d44ab

Update __init__.py

e238284

Update math.py

2ea0682

Update test_nanmean_api.py

cc70998

Update __init__.py

f3da96d

Update math.py

b8e03d0

Update test_nanmean_api.py

bfd0fff

Li-fAngyU mentioned this pull request Mar 11, 2022

【PaddlePaddle Hackathon 第二期】任务总览 #40234

Closed

Li-fAngyU added 7 commits March 12, 2022 09:47

Update test_nanmean_api.py

a2de8d0

Update test_nanmean_api.py

8efba3c

Update math.py

111ee88

Update test_nanmean_api.py

037d038

Update math.py

b0afcd4

Update the nanmean example code

Update math.py

777b29e

Update math.py

e3e78fb

Ligoml added the contributor External developers label Mar 18, 2022

Ligoml assigned jeff41404 Mar 18, 2022

Ligoml added the status: open review label Mar 18, 2022

jeff41404 reviewed Mar 18, 2022

View reviewed changes

Li-fAngyU changed the title ~~【hackathon + no.8】~~ 【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API Mar 18, 2022

Update math.py

9f29a18

修改了nanmean的axis参数的文档描述。

Ligoml previously approved these changes Apr 2, 2022

View reviewed changes

DDDivano added the status: finished label Apr 2, 2022

paddle-bot-old bot removed the status: revision label Apr 2, 2022

DDDivano added status: revision and removed status: finished labels Apr 2, 2022

Li-fAngyU added 2 commits April 2, 2022 16:29

Merge branch 'PaddlePaddle:develop' into Li-fAngyU-patch-1

c66454a

Update math.py

b03ec25

updata nanmean's sample code (:name: code-example1)

Li-fAngyU dismissed Ligoml’s stale review via b03ec25 April 2, 2022 08:32

XiaoguangHu01 previously approved these changes Apr 2, 2022

View reviewed changes

Li-fAngyU added 2 commits April 3, 2022 13:13

Merge branch 'PaddlePaddle:develop' into Li-fAngyU-patch-1

7be3764

Update math.py

32179ff

修改nanmean的example code 错误

Li-fAngyU dismissed XiaoguangHu01’s stale review via 32179ff April 3, 2022 05:14

Ligoml previously approved these changes Apr 3, 2022

View reviewed changes

Merge branch 'PaddlePaddle:develop' into Li-fAngyU-patch-1

b029831

Li-fAngyU dismissed Ligoml’s stale review via b018a5d April 3, 2022 11:58

Li-fAngyU added 2 commits April 3, 2022 19:58

Update math.py

b018a5d

update example code

Update math.py

1391212

update example code of nanmean

Ligoml approved these changes Apr 4, 2022

View reviewed changes

XiaoguangHu01 approved these changes Apr 5, 2022

View reviewed changes

jeff41404 merged commit 1d43e2d into PaddlePaddle:develop Apr 6, 2022

Li-fAngyU deleted the Li-fAngyU-patch-1 branch October 19, 2022 07:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API #40472

【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API #40472

Li-fAngyU commented Mar 11, 2022 •

edited by luotao1

Loading

paddle-bot-old bot commented Mar 11, 2022

paddle-bot-old bot commented Mar 18, 2022

jeff41404 Mar 18, 2022

Li-fAngyU Mar 18, 2022

jeff41404 Mar 18, 2022

Li-fAngyU Mar 18, 2022

jeff41404 Mar 18, 2022

Li-fAngyU Mar 18, 2022 •

edited

Loading

jeff41404 Mar 18, 2022

Li-fAngyU Mar 18, 2022

jeff41404 Mar 18, 2022

Li-fAngyU Mar 18, 2022

Ligoml commented Mar 18, 2022

Li-fAngyU commented Apr 2, 2022

Ligoml left a comment

paddle-bot-old bot commented Apr 2, 2022

paddle-bot-old bot commented Apr 2, 2022

XiaoguangHu01 left a comment

Ligoml left a comment

XiaoguangHu01 left a comment

		@@ -967,6 +967,81 @@ def nansum(x, axis=None, dtype=None, keepdim=False, name=None):
		return sum(tmp_tensor, axis, dtype, keepdim, name)


		def nanmean(x,axis=None,keepdim=None,name=None):

		if axis == None:
		return paddle.mean(x[~paddle.isnan(x)], keepdim=keepdim,name=name)

【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API #40472

【PaddlePaddle Hackathon 2】8、为 Paddle 新增 nanmean API #40472

Conversation

Li-fAngyU commented Mar 11, 2022 • edited by luotao1 Loading

PR types

PR changes

Describe

paddle-bot-old bot commented Mar 11, 2022

paddle-bot-old bot commented Mar 18, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Li-fAngyU Mar 18, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Ligoml commented Mar 18, 2022

Li-fAngyU commented Apr 2, 2022

Ligoml left a comment

Choose a reason for hiding this comment

paddle-bot-old bot commented Apr 2, 2022

paddle-bot-old bot commented Apr 2, 2022

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Ligoml left a comment

Choose a reason for hiding this comment

XiaoguangHu01 left a comment

Choose a reason for hiding this comment

Li-fAngyU commented Mar 11, 2022 •

edited by luotao1

Loading

Li-fAngyU Mar 18, 2022 •

edited

Loading