UniversalPoker/ACPC fixes: fullgame+abstracted MaxGameLength, fullgame LegalActions #1035

VitamintK · 2023-03-13T22:09:49Z

A pull request with 3 small fixes for UniversalPoker/ACPC

MaxGameLength for fullgame (i.e. unabstracted game):
MaxGameLength previously was written only for abstracted games where the only bet size is pot-size, so it assumed that each successive bet had to double in size. ~~This adds a fix to MaxGameLength for fullgame and FCHPA (where a half-pot bet is legal) where we assume only that each bet has to be greater than the size of the biggest blind.~~ This fixes MaxGameLength for fullgame and FCHPA, as well as simplifying the MaxGameLength logic for all abstractions. The size of InformationStateTensor is based on MaxGameLength, so this fixes InformationStateTensors.
MaxGameLength for abstracted game:
Change length += NumPlayers() to length += NumPlayers() - 1 for a tighter estimate for abstracted games.
LegalActions for fullgame
In unabstracted games, when a hand goes to showdown, LegalActions still returned a set of raises, despite the state being terminal. This is a fix for that.

…e state is terminal (showdown)

lanctot · 2023-03-14T08:46:49Z

Thanks @VitamintK !

The tests failed because the playthrough dors not match. Can you regenerate the playthrough and add it to the PR?

VitamintK · 2023-03-14T17:33:37Z

Done. Also added a test to universal_poker_test.cc and verified that it fails without the changes in this pull request.

jhtschultz · 2023-03-15T13:42:29Z

open_spiel/games/universal_poker.cc

+  } else {
+    while (maxStack > maxBlind) {
+      maxStack /= 2.0;             // You have always to bet the pot size
+      length += NumPlayers() - 1;  // 1 player bets, and n-2 players call


Why is it n-2 players call and not n-1?

Good question. Here's my thought process. Also let me know if you see anything wrong with the reasoning.

The longest game (with a single betting round, for simplicity) consists of:

n-1 players checking,
1 player betting, n-2 players calling,
1 player raising, n-2 players calling,
etc...,
1 player raising, n-1 players calling

so each min-bet/min-raise is succeeded by n-2 calls (except for the last bet, which is succeeded by n-1). In addition, there are n-1 checks in the beginning. These n-1 checks + the final 1 call add up to n, which is accounted for a few lines above in the code (// Check Actions).

Actually it's probably convenient for future readers of the code if I just put this comment as a code comment. What do ya think?

Thanks for the clarification. This reasoning makes sense, but there's still a few points to resolve.

First, the calculation on line 1101 only accounts for the number of raises, not all of the n-2 calls in between. So I believe that has to be updated.

Second, maybe there's a simpler way to structure this that better handles all betting abstractions. The goal is to figure out what the max number of raises is and then apply your logic regarding max number of check/calls.

max_num_raises = 0 if betting_abstraction == kFC: pass # no raises allowed elif betting_abstraction == kFCPA: pot_size = maxBlind * num_players while pot_size / num_players < maxStack: max_num_raises += 1 pot_size += pot_size * num_players elif betting_abstraction == kFCHPA: pot_size = maxBlind * num_players while pot_size / num_players < maxStack: max_num_raises += 1 pot_size += pot_size / 2 * num_players elif betting_abstraction == fullgame: max_num_raises = (maxStack+maxBlind-1)/maxBlind length += max_num_raises * (num_players - 1)

I think this is more intuitive than dividing the max stack size in half. I totally might be missing something on either of these points though so let me know what you think. Funny how even these simple things can be tricky!

First, the calculation on line 1101 only accounts for the number of raises, not all of the n-2 calls in between. So I believe that has to be updated.

ah yes, thanks. 😅 good catch!

I think this is more intuitive than dividing the max stack size in half.

Yup, definitely. The logic in your snippet all looks good, thanks!

Funny how even these simple things can be tricky!

On the bright side, this has finally made me learn how to calculate a pot-sized bet.

jhtschultz · 2023-03-15T13:48:42Z

Looks good, just left a comment with one question. And thanks for adding the test!

One other thing, please run it through a linter (see recently added bullet 8 from https://github.com/deepmind/open_spiel/blob/master/docs/developer_guide.md#adding-a-game for links to linters) since there's a few style guide things I think the linter will pick up on. Thanks

VitamintK · 2023-03-15T17:44:44Z

I ran downloaded cpplint.py and ran python cpplint.py open_spiel/games/universal_poker.cc and there was no output. Does that mean it passed the linter?

jhtschultz · 2023-03-15T23:48:58Z

Hmm I'd be surprised if there were no output. Did you just download the file or pip install cpplint? That could be it; my understanding is that you're supposed to run it as a command line tool.

VitamintK · 2023-03-16T00:20:50Z

Ah, I figured out the problem. The github.com/google/styleguide cpplint is mostly unmaintained and requires python 2, and fails silently when ran with python 3 😑: google/styleguide#132

There's a fork (which is what you get from pip install cpplint) that works with python3, so I'll run that.

As per google/styleguide#528 (comment) it's not clear whether the google/styleguide cpplint will be maintained going forward, so you might want to change the link in bullet 8 to the pip install cpplint fork (maybe this link?), to prevent future contributors from getting the silent failure.

jhtschultz · 2023-03-16T02:18:12Z

Ahh thanks for catching that. Yeah good call I just sent out a CL to update the link in the dev guide.

jhtschultz

LGTM

PiperOrigin-RevId: 519690955 Change-Id: I2d72b5fb941260a62ffef03a15289f0de05c63b9

VitamintK added 3 commits March 13, 2023 14:21

add MaxGameLength for fullgame

3e1020a

give a tighter upperbound for MaxGameLength with abstractions

4a44987

acpc, fullgame, fix LegalActions so it does not return raises when th…

3a7bb5c

…e state is terminal (showdown)

lanctot mentioned this pull request Mar 14, 2023

Information Tensor for Universal Poker/ACPC is abstracted even when the game is fullgame #1033

Closed

VitamintK added 3 commits March 14, 2023 08:24

add a test

4aa2431

also add an unabstracted universal poker playthrough

b3aa36f

add a testcase for unabstracted universal poker, and re-run playthrough

ee5e6f5

jhtschultz reviewed Mar 15, 2023

View reviewed changes

acpc MaxGameLength: apply jhtschultz's simpler algorithm

e0f464a

jhtschultz approved these changes Mar 16, 2023

View reviewed changes

lanctot added imported This PR has been imported and awaiting internal review. Please avoid any more local changes, thanks! merged internally The code is now submitted to our internal repo and will be merged in the next github sync. labels Mar 16, 2023

lanctot merged commit 2da02cf into google-deepmind:master Mar 20, 2023

lanctot added a commit that referenced this pull request Mar 31, 2023

Update doc for use of cpplint for Python. Context: #1035 (comment)

7c0fae4

PiperOrigin-RevId: 519690955 Change-Id: I2d72b5fb941260a62ffef03a15289f0de05c63b9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

UniversalPoker/ACPC fixes: fullgame+abstracted MaxGameLength, fullgame LegalActions #1035

UniversalPoker/ACPC fixes: fullgame+abstracted MaxGameLength, fullgame LegalActions #1035

VitamintK commented Mar 13, 2023 •

edited

Loading

lanctot commented Mar 14, 2023

VitamintK commented Mar 14, 2023

jhtschultz Mar 15, 2023

VitamintK Mar 15, 2023

VitamintK Mar 15, 2023

jhtschultz Mar 15, 2023

VitamintK Mar 16, 2023

jhtschultz commented Mar 15, 2023

VitamintK commented Mar 15, 2023

jhtschultz commented Mar 15, 2023

VitamintK commented Mar 16, 2023 •

edited

Loading

jhtschultz commented Mar 16, 2023

jhtschultz left a comment

UniversalPoker/ACPC fixes: fullgame+abstracted MaxGameLength, fullgame LegalActions #1035

UniversalPoker/ACPC fixes: fullgame+abstracted MaxGameLength, fullgame LegalActions #1035

Conversation

VitamintK commented Mar 13, 2023 • edited Loading

lanctot commented Mar 14, 2023

VitamintK commented Mar 14, 2023

jhtschultz Mar 15, 2023

Choose a reason for hiding this comment

VitamintK Mar 15, 2023

Choose a reason for hiding this comment

VitamintK Mar 15, 2023

Choose a reason for hiding this comment

jhtschultz Mar 15, 2023

Choose a reason for hiding this comment

VitamintK Mar 16, 2023

Choose a reason for hiding this comment

jhtschultz commented Mar 15, 2023

VitamintK commented Mar 15, 2023

jhtschultz commented Mar 15, 2023

VitamintK commented Mar 16, 2023 • edited Loading

jhtschultz commented Mar 16, 2023

jhtschultz left a comment

Choose a reason for hiding this comment

VitamintK commented Mar 13, 2023 •

edited

Loading

VitamintK commented Mar 16, 2023 •

edited

Loading