-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
refactor A2C/PPO, change behavior of value normalization #321
Conversation
related to #307 |
Codecov Report
@@ Coverage Diff @@
## master #321 +/- ##
==========================================
- Coverage 93.97% 93.95% -0.03%
==========================================
Files 51 51
Lines 3269 3274 +5
==========================================
+ Hits 3072 3076 +4
- Misses 197 198 +1
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
Refactor A2C/PPO to make them look consistent. Change behavior of value normalization, etc to be prepared for A2C benchmark.