SB3 HER Example #188

araffin · 2021-04-30T12:59:10Z

Hello,
As SB2 is now deprecated, would you be interested in an updated version of the notebook that uses SB3?

I also found better hyper-parameters (the agent reaches 90% training success in 20 000 steps only) using TQC from the contrib repo.

eleurent · 2021-04-30T14:05:57Z

Hi @araffin, yes I would, thank you very much!

I already started switching to SB3 in a few scripts, but let's make it official :)

PS: wow I really need to check out that TQC algorithm!

araffin · 2021-04-30T14:20:37Z

I will do that (and probably will have to re-do it soon as the refactored version is almost ready: DLR-RM/stable-baselines3#351).

It should work well with SAC too (with the new hyperparams), but best results are with TQC.

StarBaseOne · 2021-05-01T13:59:33Z

@araffin super stuff. Waiting for the merge.
@eleurent yep TQC works nicely (great implementation from SB3) and outperforms SAC.

I've done some baseline experiments on HER with SB3 and Parking and Summon. I will be trying at somepoint to use SB3 benchmark for HER for all 4 algorithms (SAC,TD3,TQC & DDPG) for parking-v0 but if you have the params already done can you share them with me so I can re-run them soon?

araffin · 2021-05-01T15:16:27Z

all hyperparameters are in the rl zoo: https://github.com/DLR-RM/rl-baselines3-zoo
(and i would recommend using it for doing experiments)

StarBaseOne · 2021-05-01T15:19:20Z

all hyperparameters are in the rl zoo: https://github.com/DLR-RM/rl-baselines3-zoo
(and i would recommend using it for doing experiments)

Yes thank you. I've been using your benchmark tool. It's pretty solid but haven't checked out from repo or updated it in several months.

araffin mentioned this issue May 3, 2021

Upgrade HER example from SB2 to SB3 #189

Merged

eleurent closed this as completed in #189 May 3, 2021

araffin mentioned this issue Jul 2, 2021

Fixes for SB3>v1.1.0 #208

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SB3 HER Example #188

SB3 HER Example #188

araffin commented Apr 30, 2021

eleurent commented Apr 30, 2021 •

edited

Loading

araffin commented Apr 30, 2021

StarBaseOne commented May 1, 2021 •

edited

Loading

araffin commented May 1, 2021

StarBaseOne commented May 1, 2021 •

edited

Loading

SB3 HER Example #188

SB3 HER Example #188

Comments

araffin commented Apr 30, 2021

eleurent commented Apr 30, 2021 • edited Loading

araffin commented Apr 30, 2021

StarBaseOne commented May 1, 2021 • edited Loading

araffin commented May 1, 2021

StarBaseOne commented May 1, 2021 • edited Loading

eleurent commented Apr 30, 2021 •

edited

Loading

StarBaseOne commented May 1, 2021 •

edited

Loading

StarBaseOne commented May 1, 2021 •

edited

Loading