[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 #801

z0gSh1u · 2024-08-04T08:55:06Z

Pull Request

What problem does this PR solve?

Issue Number: Fixed #727

Implemented average_precision_score function for binary classification and multi-class classification with three average methods.

deadlywing

另外，，需要在classification_emul.py中增加ap的调用测试，具体方式可以参考里面的其他例子

sml/metrics/classification/BUILD.bazel

sml/metrics/classification/classification.py

sml/metrics/classification/classification_test.py

sml/metrics/classification/classification.py

deadlywing · 2024-08-13T09:11:54Z

TODO @z0gSh1u ：

修改/测试有 tied value的情况
增加emulation测试

z0gSh1u · 2024-08-13T11:27:15Z

TODO @z0gSh1u ：

修改/测试有 tied value的情况

增加emulation测试

👌🏻 I'll ping you when I'm ready.

z0gSh1u · 2024-08-15T02:01:25Z

TODO @z0gSh1u ：

修改/测试有 tied value的情况

增加emulation测试

已修改，可以再次评审~

sml/metrics/classification/classification.py

sml/metrics/classification/classification_test.py

deadlywing · 2024-08-15T02:27:33Z

计算逻辑还是有一点问题，，
另外，发现test的tol有点太高了，可能导致你没发现问题，，建议把atol和rtol调整到1e-3

deadlywing · 2024-08-15T02:56:33Z

提供一个比较直接的思路：

这里的难点其实是tied value会导致precision计算的末尾会出现nan（或者极大的值），recall的末尾出现0值（这使得AP计算公式出错）
利用threshold>0其实可以得到tiled value有哪些的mask，则precision可以转化为[x,x,x,x,...,0,0,..] ，这就和recall的格式一致了，均为[y,y,y,y,...,0,0,..]
AP本质就是计算积分，数值上就是 diff(recall) * precision，所以我们可以直接计算diff(recall) 这个array，但是需要注意实际需要的diff(recall) = [y1-y0, y2-y1, ..., yn-y_(n-1), 0,0,0] (后面还是会有0，这个可以通过rotate加一次mux完成)
同理，precision也需要根据mask得到[x0,x1,x2,..,0,0.]（这里的逻辑可能会复杂一些，需要一些翻转之类的操作），同样，也会有尾0

anyway，核心还是围绕AP的计算公式，只不过要小心处理尾0，使得整个计算结果保持和明文一致

z0gSh1u · 2024-08-16T01:52:53Z

提供一个比较直接的思路：

这里的难点其实是tied value会导致precision计算的末尾会出现nan（或者极大的值），recall的末尾出现0值（这使得AP计算公式出错）

利用threshold>0其实可以得到tiled value有哪些的mask，则precision可以转化为[x,x,x,x,...,0,0,..] ，这就和recall的格式一致了，均为[y,y,y,y,...,0,0,..]

AP本质就是计算积分，数值上就是 diff(recall) * precision，所以我们可以直接计算diff(recall) 这个array，但是需要注意实际需要的diff(recall) = [y1-y0, y2-y1, ..., yn-y_(n-1), 0,0,0] (后面还是会有0，这个可以通过rotate加一次mux完成)

同理，precision也需要根据mask得到[x0,x1,x2,..,0,0.]（这里的逻辑可能会复杂一些，需要一些翻转之类的操作），同样，也会有尾0

anyway，核心还是围绕AP的计算公式，只不过要小心处理尾0，使得整个计算结果保持和明文一致

调整完成，使用thresholds > 0作为mask调整了precision的计算区间，并改用了更严格的tol检查。由于y_score存在0值时thresholds > 0条件不准确（trailing 0 和 0 score无法区分），因此还添加了下界score_eps。

deadlywing · 2024-08-16T04:20:01Z

hello，，正确性看上去应该没问题了，但是性能上仍然有巨大的优化空间

我在本地跑了个10000个样本测试，，通信

Link details: total send bytes 25116300315, recv bytes 25137340315, send actions 720907, recv actions 721170

可以发现这个cost还是很离谱，，主要的原因在于

sorted_pairs = pairs[jnp.argsort(pairs[:, 1], descending=True, stable=True)]

这里会调用secret index，这个在MPC下是无比昂贵的。。
可以替换为：

sorted_pairs = create_sorted_label_score_pair(y_true, y_score)

同理，recall的计算也可替换成

max_tp = jnp.max(tp)
recalls = jnp.where(max_tp == 0, jnp.ones_like(tp), tp / max_tp)

替换后，可以发现通信量缩减在250x以上，耗时也减少100x左右。。

sml/metrics/classification/classification.py

sml/metrics/classification/classification_emul.py

deadlywing · 2024-08-16T04:25:10Z

另外，，我发现emulation的文件里其他两个函数在run之前也没加seal，，这也是有问题的，，麻烦你也顺便都加上吧🙏

deadlywing · 2024-08-19T02:22:08Z

@z0gSh1u
hello，，代码已经ok了，但是我看bazel的format checker有问题，，我本地run了一下buildfier，似乎是这俩dep的顺序，，麻烦本地修改一下，，（最好本地再用buildifier check一下再push哈～）

z0gSh1u · 2024-08-19T02:54:21Z

@z0gSh1u hello，，代码已经ok了，但是我看bazel的format checker有问题，，我本地run了一下buildfier，似乎是这俩dep的顺序，，麻烦本地修改一下，，（最好本地再用buildifier check一下再push哈～）

好的，我晚些检查&修改下，顺便合一下master

deadlywing

LGTM

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数

87df961

Candicepan requested a review from deadlywing August 6, 2024 06:44

deadlywing reviewed Aug 6, 2024

View reviewed changes

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 - 调整实现逻辑

57e6a6e

deadlywing reviewed Aug 13, 2024

View reviewed changes

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

[OSCP] 使用 SPU 实现 AP 函数 - 考虑tied value，新增emulation

c1f8e9a

deadlywing reviewed Aug 15, 2024

View reviewed changes

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

sml/metrics/classification/classification_test.py Outdated Show resolved Hide resolved

z0gSh1u force-pushed the main branch from 6095b83 to 8ce3dba Compare August 16, 2024 00:48

[OSCP] 使用 SPU 实现 AP 函数 - 调整实现逻辑并使用更严格的测试

d41fefa

z0gSh1u force-pushed the main branch from 8ce3dba to d41fefa Compare August 16, 2024 00:49

[OSCP] 使用 SPU 实现 AP 函数 - 微调

a382aba

deadlywing reviewed Aug 16, 2024

View reviewed changes

sml/metrics/classification/classification.py Outdated Show resolved Hide resolved

sml/metrics/classification/classification_emul.py Outdated Show resolved Hide resolved

[OSCP] 使用 SPU 实现 AP 函数 - 降低计算代价，emulate之前seal

b404f34

z0gSh1u added 2 commits August 20, 2024 00:06

[OSCP] 使用 SPU 实现 AP 函数 - 调整BUILD.bazel

be6185b

Merge branch 'main' of github.com:secretflow/spu

ba03934

deadlywing approved these changes Aug 20, 2024

View reviewed changes

deadlywing merged commit 28fef7d into secretflow:main Aug 20, 2024
8 of 9 checks passed

github-actions bot locked and limited conversation to collaborators Aug 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 #801

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 #801

z0gSh1u commented Aug 4, 2024

deadlywing left a comment

deadlywing commented Aug 13, 2024

z0gSh1u commented Aug 13, 2024

z0gSh1u commented Aug 15, 2024

deadlywing commented Aug 15, 2024

deadlywing commented Aug 15, 2024

z0gSh1u commented Aug 16, 2024

deadlywing commented Aug 16, 2024

deadlywing commented Aug 16, 2024

deadlywing commented Aug 19, 2024

z0gSh1u commented Aug 19, 2024

deadlywing left a comment

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 #801

[OSCP] 使用 SPU 实现 AP(average_precision_score) 函数 #801

Conversation

z0gSh1u commented Aug 4, 2024

Pull Request

What problem does this PR solve?

deadlywing left a comment

Choose a reason for hiding this comment

deadlywing commented Aug 13, 2024

z0gSh1u commented Aug 13, 2024

z0gSh1u commented Aug 15, 2024

deadlywing commented Aug 15, 2024

deadlywing commented Aug 15, 2024

z0gSh1u commented Aug 16, 2024

deadlywing commented Aug 16, 2024

deadlywing commented Aug 16, 2024

deadlywing commented Aug 19, 2024

z0gSh1u commented Aug 19, 2024

deadlywing left a comment

Choose a reason for hiding this comment