TheAthleticCoder / RL-on-OpenAI-Gym Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

We Implement algorithms such as: Monte Carlo(on and off policy), Q-Learning, SARSA, Policy Iteration and Value Iteration on OpenAI Gym environments.

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
mc_qlearn_sarsa		mc_qlearn_sarsa
policy_iter_value_iter		policy_iter_value_iter
README.md		README.md
mc_qlearn_sarsa.ipynb		mc_qlearn_sarsa.ipynb
policy_iter_value_iter.ipynb		policy_iter_value_iter.ipynb
requirements_doc.pdf		requirements_doc.pdf

Repository files navigation

RL-on-OpenAI-Gym

In this repository we aim to:

1.Use CliffWalking-v0 from OpenAI gym:

Create two agents to find the optimal policy using Policy Iteration and Value Iteration.
Test-run and visualizing learning.

2.Use Taxi-v3 from OpenAI gym:

Prepare and train your agent using i) On-Policy Monte Carlo and ii) Off-Policy Monte-Carlo using Important Sampling.
Prepare and train two more agents using i) Q-Learning and ii) SARSA.

File Structure:

requirements_doc.pdf gives more detailed explanation of the requirements and the scope of this repository.
mc_qlearn_sarsa.ipynb aims to implement
1. On-Policy Monte Carlo
2. Off-Policy Monte Carlo+Importance Sampling
3. Q-Learning
4. SARSA
policy_iter_value_iter.ipynb aims to implement
1. Policy Iteration
2. Value Iteration

About

We Implement algorithms such as: Monte Carlo(on and off policy), Q-Learning, SARSA, Policy Iteration and Value Iteration on OpenAI Gym environments.

reinforcement-learning reinforcement-learning-algorithms reinforcement-learning-agent reinforcement-learning-environments

Report repository

Releases

No releases published

Packages

No packages published

Languages