Skip to content

AlphaZero vs AlphaZero or DQN #1032

Answered by lanctot
madzai asked this question in Q&A
Discussion options

You must be logged in to vote

Hi,

We don't have examples of that but it should be as easy as just copying the logic from lines 184-212. I believe the az_checkpoint flag refers to which checkpoint to load.

For DQN you'd have to train it separately (see the examples of how to do that here: https://github.com/deepmind/open_spiel/blob/master/open_spiel/algorithms/dqn_torch/dqn_torch_test.cc) and then query the agent for moves. You can find the logic to run simulations on any game in OpenSpiel in examples/example.cc. So you'd just have to ask the agent which action it wants by giving it a state (usually via a step function).

If it doesn't exist already you can make a simple DQNBot wrapper that uses the Bot API so that you …

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by lanctot
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants