Single agent PPO

Environment

The reacher environment (continuous 33 dimensional state space and continuous 4 dimensional action space). The environment is considered solved when the agent get an average score of 30 over 100 consecutive episodes.

Installing the environment

Download the environment:
- Linux: download
- Mac OSX: download
- Windows (32-bit): download
- Windows (64-bit): download
Unzip in folder p2_continuous-control.

In the code, import the UnityEnvironment as follow (the file_name should target the reader's own Reacher.exe):

from unityagents import UnityEnvironment
env = UnityEnvironment(file_name="C:\\Users\AL\Documents\GitHub\deep-reinforcement-learning\p2_continuous-control\Reacher_Windows_x86_64\Reacher.exe", no_graphics=True)

Instructions

Run the PPO_v0.py to train the agent. After being trained over 2000 episodes or if the environment is solved, the code will plot the scores and the average score over the last 100 episodes. It will save the neural network weights in network.pth. The scores will be saved in scores.txt. The code writes the current episode, the average score over the last 100 episodes, the maximum score and the current standard deviation. The agent should be able to solve the environment in approximatively 900 episodes.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
PPO_solved_922_episodes.png		PPO_solved_922_episodes.png
PPO_v0.py		PPO_v0.py
network.py		network.py
network_parameters.pth		network_parameters.pth
readme.md		readme.md
report.pdf		report.pdf
scores.txt		scores.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Single agent PPO

Environment

Installing the environment

Instructions

About

Releases

Packages

Languages

AlpoGIT/Continuous-PPO-single-agent

Folders and files

Latest commit

History

Repository files navigation

Single agent PPO

Environment

Installing the environment

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages