-
Notifications
You must be signed in to change notification settings - Fork 1.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implement TD3+BC for offline RL #660
Conversation
Codecov Report
@@ Coverage Diff @@
## master #660 +/- ##
==========================================
+ Coverage 93.63% 93.66% +0.03%
==========================================
Files 71 72 +1
Lines 4757 4786 +29
==========================================
+ Hits 4454 4483 +29
Misses 303 303
Flags with carried forward coverage won't be shown. Click here to find out more.
📣 Codecov can now indicate which changes are the most critical in Pull Requests. Learn more |
I tried to turn on/off with
Also note that the policy converges quickly so the default epoch of 200 can be reduced to 100. |
Why not add the result in |
- implement TD3+BC for offline RL; - fix a bug in trainer about test reward not logged because self.env_step is not set for offline setting;
make format
(required)make commit-checks
(required)self.env_step
is not set for offline setting;