Skip to content

Latest commit

 

History

History
3 lines (2 loc) · 209 Bytes

README.md

File metadata and controls

3 lines (2 loc) · 209 Bytes

Q-Learning

Q-learning is a model-free reinforcement learning technique. Specifically, Q-learning can be used to find an optimal action-selection policy for any given (finite) Markov decision process (MDP).