We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hi, @williamd4112
Thanks for your great work HEPO! I want to know the command for implementing the baseline EIPO because I noticed self.use_switch in the project:)
self.use_switch
Thanks for any advice! Best, Charlie
The text was updated successfully, but these errors were encountered:
Hi Charlie,
We will add EIPO to the current codebase soon. Please let me know if you have other questions and feel free to remind me at zwhong@mit.edu
Sorry, something went wrong.
@williamd4112 Thanks for your reply!
When I trained my task with HEPO, I noticed a performance collapse in HEPO's performance, not as good as ref policy, much less heuristics.
I don't know if you have encountered this when tuning your hyperparameters.
Can you offer any advice on hyperparameter tuning or anything else?
Best, Charlie
No branches or pull requests
Hi, @williamd4112
Thanks for your great work HEPO!
I want to know the command for implementing the baseline EIPO because I noticed
self.use_switch
in the project:)Thanks for any advice!
Best,
Charlie
The text was updated successfully, but these errors were encountered: