-
Notifications
You must be signed in to change notification settings - Fork 387
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature(nyz): add PPOF new interface support #567
Conversation
# Install | ||
|
||
```shell | ||
pip install git+https://github.com/nighood/rocket-recycling@master#egg=rocket_recycling |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
是否可以把这个修改版的 rocket-recycling 移到 opendilab 下面?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这个暂时不用吧,等稳定版
import treetensor.torch as ttorch | ||
from torch.distributions import Normal, Independent | ||
|
||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
增加 Overview 注释?
deterministic_eval=True, | ||
# model | ||
model=dict(), | ||
) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
增加 overview 介绍以及 ppof 与 ppo 的区别,介绍为什么需要单独写一个 ppof.py
task.run() | ||
|
||
def deploy(self, ckpt_path: str = None, enable_save_replay: bool = False, debug: bool = False) -> None: | ||
if debug: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
deploy 与 batch_evaluate 的异同是?是否可以添加一下注释
'drone_fly', | ||
'hybrid_moving', | ||
] | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
添加 bonus 路径下的文件说明?简介这里的ppof与policy/下面的ppof的异同?
Codecov Report
@@ Coverage Diff @@
## main #567 +/- ##
==========================================
- Coverage 84.59% 83.45% -1.14%
==========================================
Files 555 562 +7
Lines 45197 45489 +292
==========================================
- Hits 38233 37964 -269
- Misses 6964 7525 +561
Flags with carried forward coverage won't be shown. Click here to find out more.
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
Description
Related Issue
TODO
Check List