Reinforcement Learning, Sutton, Barto Chapter 9: On-policy Prediction with Approximation
start training
docker-compose up
play a game
docker run \
-it \
--rm \
-v $(pwd)/logs:/tmp/project/logs \
luissaybe/connect-4-reinforcement-learning python3 /tmp/project/src/play.py