Tic-tac-toe Bot based on Q-learning

Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP).

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
README.md		README.md
first_game.json		first_game.json
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tic-tac-toe Bot based on Q-learning

If you find bugs or have suggestions, feel free to email me at aiba.prenov@gmail.com

Installation

python 3 required

git clone https://github.com/aibaq/tic_tac_q.git

python main.py or python3 main.py

About

Releases

Packages

Languages

aibaq/tic_tac_q

Folders and files

Latest commit

History

Repository files navigation

Tic-tac-toe Bot based on Q-learning

If you find bugs or have suggestions, feel free to email me at aiba.prenov@gmail.com

Installation

python 3 required

git clone https://github.com/aibaq/tic_tac_q.git

python main.py or python3 main.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages