GitHub - Immiora/deepPongRL: A mini-project to make a deep RL agent to play pong in the openAI gym. This was made in the summer school CCNSS.

DeepPongRL

A mini-project to make a deep RL agent to play pong in the openAI gym. This was made in the summer school CCNSS.

Training of the RL agent using policy gradients.

Trainig was done in OpenAI Gym environment with Python 2.7

Used packages:

OpenAI Gym
Keras (TF backend)

We would like to train an RL agent to win at Pong game

A. Karpathy introduced a policy gradient algorith for training an RL agent. The agent learns by processing pixel information from each frame of the game. The optimal policy (prob(action|image)) is calculated by adding a reward function to the neural network gradient.

As an alternative to policy gradients we looked at the deep Q-learning algorithm.

One of our best models (Policy gradient, MLP 1 hidden layer, 200 hidden units, relu activation) learnt to beat the built-in AI agent in OpenAI Pong environment.

The policy gradient model was trained on ~5000 episodes and almost reached 0-reward.

We did some preliminary model comparison and observed that shallower models (1-layer) converged faster (final performance subject to number of training episodes). We also saw that policy gradient training resulted in faster and more graduate training compared to deep Q-learning.

Our final note concerned GPU vs CPU computational time differences. We report that smaller networks are trained faster on a CPU unit.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
final_report		final_report
results		results
README.md		README.md
keras_pong_v5.py		keras_pong_v5.py
keras_pong_v5_monitor.py		keras_pong_v5_monitor.py
openaigym.video.0.8268.video000001.mp4		openaigym.video.0.8268.video000001.mp4
openaigym.video.0.8268.video000001.mp4_snapshot.jpg		openaigym.video.0.8268.video000001.mp4_snapshot.jpg
openaigym.video.3.7528.video000001.mp4		openaigym.video.3.7528.video000001.mp4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepPongRL

About

Releases

Packages

Languages

Immiora/deepPongRL

Folders and files

Latest commit

History

Repository files navigation

DeepPongRL

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages