Núria Casals Lladó (https://github.com/nuriacasals) & Robin de Groot
Lab assignment 1 for the Reinforcement Learning course at KTH. The assignment consists of two maze environments, one with a minotaur who the player needs to avoid on their path to the exit, and one with a bank robber (the player) and a police officer.