-
Notifications
You must be signed in to change notification settings - Fork 4.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
multi-gpu ppo #2288
multi-gpu ppo #2288
Conversation
…gies/ml-agents into develop-multi-gpu
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Had some in person and in-PR feedback. Let's also chat a bit about testing this.
…gies/ml-agents into develop-multi-gpu
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is looking a lot better to my eyes. I think you'll want to at least make sure we get some unit test coverage on the core pieces of the MultiGpuPPOPolicy code-- update
, average_gradients
, and get_devices
. It should now be easier to mock out construct_feed_dict
so that you can be sure your update
function behaves as you'd expect.
add multi-gpu ppo training