You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is a simple agent that analyzes the states of blackjack in the case of player vs dealer. The goal is to replicate the results that were proposed by sutton and barto in book Reinforcement Learning Example 5.3.
We will be using Montecarlo off policy methods with 0 discounting.