-
Notifications
You must be signed in to change notification settings - Fork 247
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove FloatReward #829
Remove FloatReward #829
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM.
For the failing tests, it seems like the issue is with the incompatibility of Ray with Pydantic 2.5+. According to this issue, Ray 2.9 which will be released end of December should fix the error. For now we can add pydantic<2 in the dependencies as a temporary fix.
86c00fe
to
80cb9a0
Compare
Codecov ReportAll modified and coverable lines are covered by tests ✅
Additional details and impacted files@@ Coverage Diff @@
## master #829 +/- ##
==========================================
- Coverage 95.69% 95.69% -0.01%
==========================================
Files 102 102
Lines 9650 9642 -8
==========================================
- Hits 9235 9227 -8
Misses 415 415 ☔ View full report in Codecov by Sentry. |
7e04341
to
a55ff9e
Compare
I had to remove some outdated test code. Would you re-review @taufeeque9 ? |
The pytorch warning check in |
Hmm the two changes in #332 make no sense to me. As I understand it, |
IIUC the test is checking that the warning does not occur: assert not (
warning.category == UserWarning
and "NumPy array is not writeable" in warning.message.args[0]
) so I think we can keep that test code in. Only looked at this quickly, quite possibly missing something obvious, please do LMK if so! |
b74ad0f
to
a55ff9e
Compare
A quick fix for #794.
After the newest release of SB3 we don't need the FloatReward wrapper any more.
Fixes #794