PPO_implementation_v4.0

PPO algorithm implemetation for continuous action spaces (TF 2.8.0)

The algorithm was tested on OpenAI Gym 'Pendulum-v1' environment

This implementation is heavily inspired by @mandrakedrink's one for PyTorch (https://github.com/mandrakedrink/PPO-pytorch)

To take a look at the parameters you can set, run: python main.py -h

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Pendulum-v1/test_pendolo		Pendulum-v1/test_pendolo
logs		logs
utils		utils
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
env_wrappers.py		env_wrappers.py
main.py		main.py
networks.py		networks.py
ppo_memory.py		ppo_memory.py

Provide feedback