-
Notifications
You must be signed in to change notification settings - Fork 5.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add save checkpoint on pserver. #10376
Comments
The goal of the checkpoint is: add save/restore checkpoint to PServer and add restore variables/connections to Trainer to realize fault tolerant.
|
背景:
M1 阶段设计方案: |
另外还有一个细节:pserver load checkpoint的时候,需要能知道自己需要load的一部分数据。并通过attr传给restore_op |
OP中的 |
您好,此issue在近一个月内暂无更新,我们将于今天内关闭。若在关闭后您仍需跟进提问,可重新开启此问题,我们将在24小时内回复您。因关闭带来的不便我们深表歉意,请您谅解~感谢您对PaddlePaddle的支持! |
No description provided.
The text was updated successfully, but these errors were encountered: