reinforcement-learning-intro

Study of Sutton and Barto Reinforcement Learning: An Introduction.

Tabular Methods

Bandits!

Performance of bandit algorithms:

Gradient bandits and greedy bandits with optimistic expectations generally perform the best. Gradient bandits take longer to converge but have the potential to reach a higher performance.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
algorithms_tabular		algorithms_tabular
blackjack		blackjack
games		games
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
RL intro notes.md		RL intro notes.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

reinforcement-learning-intro

Tabular Methods

Bandits!

About

Releases

Packages

Languages

License

Fibration/reinforcement-learning-intro

Folders and files

Latest commit

History

Repository files navigation

reinforcement-learning-intro

Tabular Methods

Bandits!

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages