Environments

Inspired by the Deep Reinforcement Learning Nanodegree course material.

Summary

Switch backends between Tensorflow and PyTorch.
Explore new environment via yaml config files (no coding required).
Supported Algorithms
- Expected Sarsa
- Deep Q Learning
- Dueling Deep Q Learning
- Double Deep Q Learning
- Prioritized Experience Replay (with Importance Sampling)
- CNN variation of Deep Q Network and Dueling Deep Q Network
Supported Environments
- OpenAI Gym
- Unity ML Agents
Pre-trained Checkpoints for few environments.

Installation

1. Setup Python 3

MacOS

brew install python3 swig && \
    brew install opencv3 --with-python && \
    pip3 install --upgrade pip setuptools wheel

Ubuntu

sudo apt-get install swig python3 python3-pip && \
    sudo pip3 install --upgrade pip setuptools wheel && \
    sudo pip3 install virtualenv

2. Setup Virtual Environment

virtualenv --no-site-packages -p python3 .venv && \
    source .venv/bin/activate && \
    pip install -r requirements.txt

Usage

1. Switch to Virtual Environment

source .venv/bin/activate

2. Train an Agent

python3 train.py -c configs/cartpole.yaml

3. Watch an Agent

python3 watch.py -c configs/cartpole.yaml

Switch Backends

By default PyTorch backend is used, if it is available in the runtime. In case you want to switch to:

Tensorflow

export RL_BACKEND='tf'

PyTorch

export RL_BACKEND='torch'

Environments

Banana (Unity)

Environment Details

Type: UnityML
Goal: The agents must learn to collect as many yellow bananas as possible while avoiding blue bananas.
Reward:
- +1 for collecting yellow banana.
- -1 for collecting blue banana
State Space: 37 dimensions which contains the agent's velocity, along with ray-based perception of objects around agent's forward direction.
Action Space:
- 0 - move forward.
- 1 - move backward.
- 2 - turn left.
- 3 - turn right.
Solved when: Agent gets an average score of +13 over 100 consecutive episodes.

Setup

Download the environment from one of the links below to the checked out directory. You need only select the environment that matches your operating system:
- Linux: click here
- MacOS: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Edit configs/banana.yaml and change filename field according to your environment.
- Linux: Banana_Linux/Banana.x86_64
- MacOS: Banana.app
- Windows (32-bit): Banana_Windows_x86/Banana.exe
- Windows (64-bit): Banana_Windows_x86_64/Banana.exe

Usage

Train

python3 train.py -c configs/banana.yaml

Watch

python3 watch.py -c configs/banana.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
checkpoints		checkpoints
configs		configs
docker		docker
reports		reports
rl		rl
test		test
.gitignore		.gitignore
README.md		README.md
Report.md		Report.md
requirements.txt		requirements.txt
train.py		train.py
watch.py		watch.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summary

Installation

1. Setup Python 3

MacOS

Ubuntu

2. Setup Virtual Environment

Usage

1. Switch to Virtual Environment

2. Train an Agent

3. Watch an Agent

Switch Backends

Tensorflow

PyTorch

Environments

Banana (Unity)

Environment Details

Setup

Usage

Train

Watch

Credits

Reinforcement Learning base Implementation

Machine Learning Softwares

About

Releases

Packages

Languages

RitwikSaikia/udacity_drln_playground

Folders and files

Latest commit

History

Repository files navigation

Summary

Installation

1. Setup Python 3

MacOS

Ubuntu

2. Setup Virtual Environment

Usage

1. Switch to Virtual Environment

2. Train an Agent

3. Watch an Agent

Switch Backends

Tensorflow

PyTorch

Environments

Banana (Unity)

Environment Details

Setup

Usage

Train

Watch

Credits

Reinforcement Learning base Implementation

Machine Learning Softwares

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages