-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove puzzles that share the same solution #12
Comments
For example I find this duplicate puzzle in autocorr created chennis collection (only difference is the so called supplied move "sm"):
|
I would like to distinguish a few criteria here:
More sophisticated duplicates are unrealistic to cover I think, and also are within the lichess puzzles, e.g., the exact same mating pattern on different squares or with different material configurations. For puzzles with identical starting positions, with default generator settings this should not occur, since it filters duplicate FENs already there. In the case of including the supplied move the duplicate filter could potentially be improved by using the resulting FEN for filtering instead of the FEN + move pair. chess-variant-puzzler/generator.py Lines 34 to 35 in a8bf5fe
It might be a small hit on performance because you need to compute another FEN, but code-wise should be easy to change. Identical solution lines from different starting positions are rather difficult to filter, since it is very hard to tell if the pattern is really the same. Also it might not even be a bad idea to have the same pattern in different contexts to generalize the pattern recognition. The only place where I used a very specific filtering of this kind so far was for Manchu, since probably 90% of your checkmate puzzles will just be the banner landing on c7/g7, which is super dull. For other variants I haven't seen anything like this though. With regards to identical final positions, the main pattern occurring in practice is that a forced mate in n happened in the game. It will then report the mate in n, mate in n-1, ..., and mate in 1 all as separate puzzles. This looks very repetitive on a small scale when you look at the ordered list of resulting puzzles, and that is what some people have heavily criticized, but once puzzles are unordered/randomized I think their similarity hardly is a problem. Actually, having the same puzzle on different levels of difficulty IMO can be very useful. So I don't see a strong need for such a filtering. So all in all the only minor improvement I currently see is to fix the duplicate filtering for the scenario of using the supplied move notation. Other than that apart from very specific problems like the Manchu one I do not consider duplicates a big problem so far. |
Telling the truth, I just posted above example just as a curiosity. I completely agree with you. There is no real need to fix the supplied move case. I can delete one of them, if it occurs again. |
To avoid slight variations that lead to the same exact solution as chapters 2-3 of https://lichess.org/study/vxrJlFCV
The text was updated successfully, but these errors were encountered: