AlphaZero vs AlphaZero or DQN #1032

madzai · 2023-03-10T05:06:35Z

madzai
Mar 10, 2023

Hello,

./build/examples/alpha_zero_torch_game_example.cc

You can set a trained AlphaZero agent to play against another player (mcts/random/human). What's the best way to change it so that it allows both players to be trained AlphaZero agents loaded from different checkpoint paths, or to load a trained DQN agent to play AZ vs DQN?

Thank you.

Answered by lanctot

Mar 17, 2023

Hi,

We don't have examples of that but it should be as easy as just copying the logic from lines 184-212. I believe the az_checkpoint flag refers to which checkpoint to load.

For DQN you'd have to train it separately (see the examples of how to do that here: https://github.com/deepmind/open_spiel/blob/master/open_spiel/algorithms/dqn_torch/dqn_torch_test.cc) and then query the agent for moves. You can find the logic to run simulations on any game in OpenSpiel in examples/example.cc. So you'd just have to ask the agent which action it wants by giving it a state (usually via a step function).

If it doesn't exist already you can make a simple DQNBot wrapper that uses the Bot API so that you …

View full answer

lanctot · 2023-03-17T19:46:37Z

lanctot
Mar 17, 2023
Maintainer

Hi,

We don't have examples of that but it should be as easy as just copying the logic from lines 184-212. I believe the az_checkpoint flag refers to which checkpoint to load.

For DQN you'd have to train it separately (see the examples of how to do that here: https://github.com/deepmind/open_spiel/blob/master/open_spiel/algorithms/dqn_torch/dqn_torch_test.cc) and then query the agent for moves. You can find the logic to run simulations on any game in OpenSpiel in examples/example.cc. So you'd just have to ask the agent which action it wants by giving it a state (usually via a step function).

If it doesn't exist already you can make a simple DQNBot wrapper that uses the Bot API so that you don't have to special-case how you get moves from DQN (if you do that, feel free to contribute it :))

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AlphaZero vs AlphaZero or DQN #1032

{{title}}

Replies: 1 comment

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

AlphaZero vs AlphaZero or DQN #1032

madzai Mar 10, 2023

Replies: 1 comment

lanctot Mar 17, 2023 Maintainer

madzai
Mar 10, 2023

lanctot
Mar 17, 2023
Maintainer