RL Engine

An RL engine (monte carlo, SARSA, Qlearning included) for solving reinforcement learning problems.

We get back a 4-tuple from openai gyms, which is of the form: (observation, reward, done, info). We will use observations, reward, and done to generate episodes via an off-policy algorithm.

And then, we will feed the episodes to a target policy that will run improvements upon it. Simple idea.

Details about implementation is covered in the blog about Monte Carlo, and the blog about TD methods.

Try running the python files in examples. They are examples on how to use the MC model.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RL Engine

Files

README.md

Latest commit

History

README.md

File metadata and controls

RL Engine