PPO

An implementation of Proximal Policy Optimization (PPO) using Generalized Advantage Estimation (GAE) and multi-processing.

Installing

python3 -m venv venv to set up a virtual environment

cd pip install . to install, or pip install -e . for development.

python src/run_ppo.py config/pendulum.yaml trains PPO for a given config file. Examples for different environments with hyperparameters I've found that work well can be found in config/.

Paper Notes

See https://salmanmohammadi.github.io/content/ppo/ for an explanation of the method.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
config		config
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO

Installing

Paper Notes

About

Releases

Packages

Languages

SalmanMohammadi/ppo

Folders and files

Latest commit

History

Repository files navigation

PPO

Installing

Paper Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages