tic-tac-toe-RL

This is a simple demonstration of how an RL agent can learn to play tic-tac-toe by playing against itself and updating its knowledge table using Temporal-Difference method.

This is inspired from the example provided in the introduction of the book Reinforcement Learning, second edition: An Introduction

How to play against the agent?

Run go run main.go from root. Additionally set number of training episodes in main.go to get the desired strength of the agent. By 10,000 training episodes, the agent plays optimally.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
models		models
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main		main
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tic-tac-toe-RL

How to play against the agent?

About

Releases

Packages

Languages

License

sparsh2/tic-tac-toe-RL

Folders and files

Latest commit

History

Repository files navigation

tic-tac-toe-RL

How to play against the agent?

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages