-
Notifications
You must be signed in to change notification settings - Fork 328
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature Request] Custom environment tutorial #919
Comments
Thanks for this! Wonderful! Do you think we should make a tutorial out of it, or an example? Can you have a look at this: It's a tutorial to code the pendulum, sort of the simplest thing i could think of. |
I guess an example is shorter than a tutorial so this could kind of go along with the creating the custom env tutorial since you mention stateless and stateful environments this could be an example of a stateful one. And having a short and to the point example may benefit some users who just want answers quickly. I'd be happy to make any adjustments in that regard. As for the pendulum tutorial, I think it's gonna be good, I like having everything explained in detail. I'm just reading it as code in the .py file; do you have a link to the .html? Because I can't really get a good feel for if it's easy to read or not, right now it's hard. |
Yep it needs some polishing but here it is |
Comments:
The rest is simple and understandable :) |
One year and this is not a thing yet? :D Just copy-paste that notebook in the official repo. How can that hurt? |
We do have a tutorial now, have you had a chance to check it out? https://pytorch.org/rl/tutorials/pendulum.html |
Motivation
Hey, I've had a rough time snooping through the source code to figure out how to make a custom environment using EnvBase so I've made an early draft of a tutorial based on my experience :). It's a simple control problem with a linear system. I'm not sure if this is the way to submit this or if I should make a pull request somehow so please do tell me about that.
Solution
Here's the jupyter notebook. I think it's realively short and understandbale and I'd appreciate the feedback i.e. is it actually implemented in the way it was intended to(I've managed to train ppo on it so it works but it might not be up to the best practices).
Alternatives
/
Additional context
/
Checklist
The text was updated successfully, but these errors were encountered: