Skip to content

Conversation

@lshpku
Copy link
Contributor

@lshpku lshpku commented Jul 29, 2025

PR Category

Communication Library

PR Types

Performance

Description

将 intranode & internode 的底层 kernel 更新至官方commit:deepseek-ai/DeepEP@079c5a4 (7月14日)
该 commit 已包含 TMA 优化 internode 性能

本PR修改内容

intranode.cuinternode.cuconfigs.cuhibgda_device.cuh直接拷贝过来

launch.cuhutils.cuh拷贝过来,但保留 low_latency 仍然依赖的 deprecated 的函数(low_latency 由推理同学维护,不做修改)

runtime.culayout.cu拷贝过来,合并成一个runtime.cu(之前也是这样合并的)

api.cuh中 intranode & internode 的部分拷贝过来

deep_ep.hpp中 Buffer 的成员变量做小幅修改

deep_ep.cpp中 Buffer 的构造函数和 sync 方法,以及涉及 intranode & internode 调用的地方做了修改,正确设置新增的成员变量,适配新的 CUDA 层接口

types.h里增加一个 helper 方法

正确性测试

使用 test_intranode.pytest_internode.py(2、4、8机)进行了单测,均通过

使用DeepseekV3进行了多种PP、EP配置的端到端收敛性测试,均通过

性能变化

新版的优势在于可以用更少的SM达到相同的通信带宽,从而为计算分配更多的SM

例如在DeepseekV3上,deepep sm 20->14, deepgemm sm 112->118,端到端提升 1-2%


Pcard-85711

@lshpku lshpku requested review from ForFishes and sneaxiy as code owners July 29, 2025 06:49
@paddle-bot
Copy link

paddle-bot bot commented Jul 29, 2025

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@lshpku lshpku force-pushed the update-deepep-079c5a4 branch 3 times, most recently from 117bd81 to 5f846fc Compare August 4, 2025 11:09
@lshpku lshpku force-pushed the update-deepep-079c5a4 branch from 5f846fc to a3d0d9e Compare August 6, 2025 09:17
Copy link
Contributor

@gongweibao gongweibao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Copy link
Contributor

@XiaoguangHu01 XiaoguangHu01 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@lshpku lshpku merged commit 530cd6d into PaddlePaddle:develop Aug 12, 2025
80 of 83 checks passed
maxiaolong001 pushed a commit to maxiaolong001/Paddle that referenced this pull request Aug 12, 2025
lshpku added a commit to lshpku/Paddle that referenced this pull request Oct 28, 2025
lshpku added a commit to lshpku/Paddle that referenced this pull request Oct 28, 2025
zyfncg added a commit to zyfncg/Paddle that referenced this pull request Nov 12, 2025
zyfncg added a commit to zyfncg/Paddle that referenced this pull request Nov 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants