Use regexp_property_values
gem for accurate property mapping
#40
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Replace usage of the unicode ranges pstore (which had not been updated for unicode 13.0 --> 15.1+!), with the
regexp_property_values
gem.The gem hooks into the C API to directly load the matching codepoints for named properties. It's something. wanted to do originally with this library, 10+ years ago, but didn't know how back then! 😄
Thanks to @jaynetics for pointing this out, and building the above gem.
This PR solves the long-standing issue: #14
It fixes several related issues:
/\p{Carian}/
. This was because the script to generate (by brute force!) any matching characters only searched up to0xFFFF
, but the only matching characters start higher than this, from0x102A0
:"𐊠𐊡𐊢𐊣𐊤𐊥𐊦𐊧𐊨𐊩..."
Age=15.1
, or overlooked in the original script, e.g.\p{In Miscellaneous Mathematical Symbols-B}
. Not any more, because it's directly calling the Onigmo API.Examples: