Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Original paper results #1

Open
marcociccone opened this issue Jul 30, 2019 · 2 comments
Open

Original paper results #1

marcociccone opened this issue Jul 30, 2019 · 2 comments

Comments

@marcociccone
Copy link

Hi @thomasj02,
Thank you for your pytorch implementation! I'm spending some time trying to reproduce the results from the original paper and the lua code, but unfortunately with no luck.

Have you been able to reach good exploitability values? The paper says around 0.06 for Leduc Poker and indeed I can reach ~0.08 with the lua code. With your code instead I'm able to get ~0.6 with 800K iterations. What is your experience?

Thanks,
Marco

@thomasj02
Copy link
Owner

I've mostly put this aside, as NFSP has been largely superseded by the more recent Deep CFR, Single Deep CFR, and Pluribus papers.

@marcociccone
Copy link
Author

yeah, I'm well aware of those papers. I need NFSP as baseline for my project, and I'm surprised by the difficulties in reproducing this paper.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants