-
Notifications
You must be signed in to change notification settings - Fork 74.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Library Conversion: Open AI Baselines #25349
Comments
I believe even google dopamine needs an update to tf 2.0 |
Hi I have previously worked on some projects with OpenAI Baselines, can I have a try on this? |
Adding @tanzhenyu as an assignee here because he's been taking a lead on this & on coordinating with the OpenAI team. Zhenyu, do you have any part of the migration in mind that you feel open source contributors would be able to assist with right now? |
Sorry for the delay. Yes it'd be great to get some help from external contributors. @ffahleraz would you be able to help on one or two algos? |
@tanzhenyu I'd be happy to assist as well! I could help with an algorithm or two. Would that be useful? |
@josiahbjorgaard Yep that'd be very helpful. The discussion with OpenAI is to first port things to my own repo. I have already done so with PPO/A2C/DDPG, and working on DQN right now. We can use help on 1) GAIL, 2) ACER, 3) ACKTR, 4) HER, 5) TRPO. |
@tanzhenyu I'll start on ACER |
TF-Agents is prioritized over Dopamine - @dynamicwebpaige. If anyone is interested, there's also Stable Baselines - a TensorFlow-based fork of OpenAI's Baselines: https://github.com/Stable-Baselines-Team/stable-baselines with ACER, A2C, ACKTR, HER, PPO2, SAC etc. Full blog post here. |
@tanzhenyu I will start on ACKTR, if no one works already on this. |
Go for it @MoritzTaylor - handing the ownership over to you - you may be a lot quicker 👍 |
@MoritzTaylor Sounds good. Have fun and let me know how I could help! |
I will start working on GAIL! 🚀 |
@MoritzTaylor @josiahbjorgaard @seungjaeryanlee Some update from my side: I have merged the current local repo into baselines tf2 branch, you could either sync from there or sync from my repo (since I will probably continue working on hindsight experience replay). And let me know how I can help! |
@tanzhenyu I sync already from your repo to be up to date with your commits. |
@MoritzTaylor Ah yes, KFAC is a great optimizer and unfortunately we don't include them in TF core. |
@josiahbjorgaard @seungjaeryanlee Gently ping and let me know if you need additional help. |
@tanzhenyu Somebody else should take over mine. I haven't had a chance to get anywhere with it. |
@josiahbjorgaard Alright. |
Yes, we would like also to support tf 2.0 in the future for stable baselines. Corresponding issue: hill-a/stable-baselines#366 @dynamicwebpaige Should I open a new issue for that? |
@tanzhenyu if there are any open issues that a community contributor can help with, I'd be more than happy to help! |
Sure thing! Is there a date that you need it by? @tanzhenyu |
@avnishn Probably in a month or two. I don't have much bandwidth to make this at the moment, but ACER should be pretty straightforward compared to other algs. Let me know if you run into any issues. |
@tanzhenyu |
@MoritzTaylor Yes that makes sense to rewrite while maintaining most of the main logic. |
@tanzhenyu It seems nobody has taken TRPO, I'd be happy to help with that or any other alogs. |
Here. |
@tanzhenyu it seems TRPO is rewritten. If there are any open issues that I can help with, I'd be very happy to help! |
Have you finished KFAC yet? I cloned your KFAC.py and got an error |
@araffin It seems that you don't have big plans to upgrade stable-baselines to TF 2, although you already started another Github project, but you are apparently investing your time more on stable-baselines-3 with PyTorch. TF 2.0 is more widely used than PyTorch, so I feel this is somehow a mistake. |
i already replied to you in the SB issue, but for others the answers about our choice to use PyTorch are in hill-a/stable-baselines#366 and hill-a/stable-baselines#733 |
I'm gonna close this issue, given the OpenAI Baseline conversions is complete under branch tf-2. We don't have intermediate plans to support StableBaselines. |
OpenAI Baselines is a set of high-quality implementations of reinforcement learning algorithms. These algorithms make it easier for the research community to replicate, refine, and identify new ideas, and creates solid baselines to build research on top of.
The purpose of this ticket will be to migrate the OpenAI Baselines library to the TF 2.0 API.
For more information on OpenAI baselines, check out their blog post here - and be sure to amble around their website, as well as the OpenAI gym!
The text was updated successfully, but these errors were encountered: