-
Notifications
You must be signed in to change notification settings - Fork 772
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SB3 HER Example #188
Comments
I will do that (and probably will have to re-do it soon as the refactored version is almost ready: DLR-RM/stable-baselines3#351). It should work well with SAC too (with the new hyperparams), but best results are with TQC. |
@araffin super stuff. Waiting for the merge. I've done some baseline experiments on HER with SB3 and Parking and Summon. I will be trying at somepoint to use SB3 benchmark for HER for all 4 algorithms (SAC,TD3,TQC & DDPG) for parking-v0 but if you have the params already done can you share them with me so I can re-run them soon? |
all hyperparameters are in the rl zoo: https://github.com/DLR-RM/rl-baselines3-zoo |
Yes thank you. I've been using your benchmark tool. It's pretty solid but haven't checked out from repo or updated it in several months. |
Hello,
As SB2 is now deprecated, would you be interested in an updated version of the notebook that uses SB3?
I also found better hyper-parameters (the agent reaches 90% training success in 20 000 steps only) using TQC from the contrib repo.
The text was updated successfully, but these errors were encountered: