Reinforcement Learning

Learn how to sail safely in dangerous waters to your goal by creating a generation based reinforcement learning model using Q-Learning or SARSA.

What makes the waters dangerous?

There are three types of tiles in the deterministic environment:

no waves - every action moves the boat by 0 additional fields
yellow waves - every action moves the boat by 1 additional field
red waves - every action moves the boat by 2 additional fields

When you use the stochastic environment you can choose the probability of the waves.

Configuration

You can configure:

learning algorithm
learning rate (alpha)
discount factor (gamma)
number of generations
exploration and explotation proportion
value of positive/negative rewards
environment type (stochastic or deterministic)

Statistics

The program automatically generates, shows and exports some related statistics.

Heatmaps

Q-Values

A .csv file showing the q-value of every action (columns) for each episode (rows).

Rewards/Episode

Steps/Episode

This program was developed as part of a university project.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
results		results
.gitignore		.gitignore
README.md		README.md
boat.gif		boat.gif
environment.py		environment.py
goal.gif		goal.gif
main.py		main.py
plotting.py		plotting.py
reinforcement_learning.py		reinforcement_learning.py
windStrong.gif		windStrong.gif
windWeak.gif		windWeak.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning

What makes the waters dangerous?

Configuration

Statistics

Heatmaps

Q-Values

Rewards/Episode

Steps/Episode

About

Releases

Packages

Languages

mzakarian/Reinforcement-Learning-SARSA-Q-Learning

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning

What makes the waters dangerous?

Configuration

Statistics

Heatmaps

Q-Values

Rewards/Episode

Steps/Episode

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages