-
Notifications
You must be signed in to change notification settings - Fork 2.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Search tuning at very long time control. #3937
Conversation
Merged via 84b1940, congrats :-) |
Viz was very clear in the commit message about the deviation from the fishtest rules, the very bad behavior at STC and the cherry picking of the VLTC SPRT. Perhaps the right thing to do was to open an Issue to discuss these speculative tests that show the possibility to get improvement at VLTC. I'm worried that the new master is biasing all the new tests, because respect to the previous master is weak in SPRT at STC and perhaps on par at LTC, so IMO better to talk about a revert. |
This commit was tested also at smp conditions, here are the results, for future reference: SMP-STC (5+0.05 th 8): https://tests.stockfishchess.org/tests/view/620e8d5226f5b17ec885144b SMP-LTC (20+0.2 th 8): https://tests.stockfishchess.org/tests/view/620e8d7a26f5b17ec8851450 SMP-VLTC (60+0.6 th 8): https://tests.stockfishchess.org/tests/view/620e8eb226f5b17ec8851462 |
Other Tests of this commit 60+0.6 th 1 https://tests.stockfishchess.org/tests/view/6210f2e5b1792e8985f86e01 Previous RT: 30+0.3 th 8 https://tests.stockfishchess.org/tests/view/62115f93b1792e8985f87eb3 Previous RT: 180+1.8 th 1 https://tests.stockfishchess.org/tests/view/62120c34b1792e8985f89b1a Previous commit: https://nextchessmove.com/dev-builds/84b1940fcae95bb0a641dda9e85cb96f8c21cd22 Previous commit: |
This patch is a result of tuning done by user @candirufish after 150k games. Since the tuned values were really interesting and touched heuristics that are known for their non-linear scaling I decided to run limited games LTC match, even if the STC test was really bad (which was expected). After seeing the results of the LTC match, I also run a VLTC (very long time control) SPRTtest, which passed. The main difference is in extensions: this patch allows much more singular/double extensions, both in terms of allowing them at lower depths and with lesser margins. Failed STC: https://tests.stockfishchess.org/tests/view/620d66643ec80158c0cd3b46 LLR: -2.94 (-2.94,2.94) <0.00,2.50> Total: 4968 W: 1194 L: 1398 D: 2376 Ptnml(0-2): 47, 633, 1294, 497, 13 Performed well at LTC in a fixed-length match: https://tests.stockfishchess.org/tests/view/620d66823ec80158c0cd3b4a ELO: 3.36 +-1.8 (95%) LOS: 100.0% Total: 30000 W: 7966 L: 7676 D: 14358 Ptnml(0-2): 36, 2936, 8755, 3248, 25 Passed VLTC SPRT test: https://tests.stockfishchess.org/tests/view/620da11a26f5b17ec884f939 LLR: 2.96 (-2.94,2.94) <0.50,3.00> Total: 4400 W: 1326 L: 1127 D: 1947 Ptnml(0-2): 13, 309, 1348, 526, 4 closes official-stockfish#3937 Bench: 6318903
This patch is a result of tuning done by user @candirufish after 150k games. Since the tuned values were really interesting and touched heuristics that are known for their non-linear scaling I decided to run limited games LTC match, even if the STC test was really bad (which was expected). After seeing the results of the LTC match, I also run a VLTC (very long time control) SPRTtest, which passed. The main difference is in extensions: this patch allows much more singular/double extensions, both in terms of allowing them at lower depths and with lesser margins. Failed STC: https://tests.stockfishchess.org/tests/view/620d66643ec80158c0cd3b46 LLR: -2.94 (-2.94,2.94) <0.00,2.50> Total: 4968 W: 1194 L: 1398 D: 2376 Ptnml(0-2): 47, 633, 1294, 497, 13 Performed well at LTC in a fixed-length match: https://tests.stockfishchess.org/tests/view/620d66823ec80158c0cd3b4a ELO: 3.36 +-1.8 (95%) LOS: 100.0% Total: 30000 W: 7966 L: 7676 D: 14358 Ptnml(0-2): 36, 2936, 8755, 3248, 25 Passed VLTC SPRT test: https://tests.stockfishchess.org/tests/view/620da11a26f5b17ec884f939 LLR: 2.96 (-2.94,2.94) <0.50,3.00> Total: 4400 W: 1326 L: 1127 D: 1947 Ptnml(0-2): 13, 309, 1348, 526, 4 closes official-stockfish#3937 Bench: 6318903
This patch is a result of tuning done by @candirufish after 150k games.
Since results were really interesting and touched heuristics that are known for it non-linear scaling I decided to run limited games LTC test even with really bad STC (which I expected to be really bad) and seeing it results I also run VLTC SPRT.
The main difference is in extensions, this patch allows much more singular/double extensions, both in terms of allowing them at lower depths and with lesser margins.
Failed STC:
https://tests.stockfishchess.org/tests/view/620d66643ec80158c0cd3b46
LLR: -2.94 (-2.94,2.94) <0.00,2.50>
Total: 4968 W: 1194 L: 1398 D: 2376
Ptnml(0-2): 47, 633, 1294, 497, 13
Performed well at LTC:
https://tests.stockfishchess.org/tests/view/620d66823ec80158c0cd3b4a
ELO: 3.36 +-1.8 (95%) LOS: 100.0%
Total: 30000 W: 7966 L: 7676 D: 14358
Ptnml(0-2): 36, 2936, 8755, 3248, 25
Passed VLTC SPRT:
https://tests.stockfishchess.org/tests/view/620da11a26f5b17ec884f939
LLR: 2.96 (-2.94,2.94) <0.50,3.00>
Total: 4400 W: 1326 L: 1127 D: 1947
Ptnml(0-2): 13, 309, 1348, 526, 4
bench 6318903