You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In one. it appears that we do find a breakpoint between the e and the . -- my initial guess is that the lookahead sees the . and returns a different rule status (since it could be an abbreviation, URL, etc) than ? which is a definite hard stop. @makotokato@aethanyc Can you take a look at what the expected behavior is here?
I was testing an example from https://docs.rs/icu/latest/icu/segmenter/struct.WordSegmenter.html and I spotted a bug.
If the last character is
.
preceded directly by word, the word isn't returned in result.I expect it to return
["one"]
but it returns empty vector. But if.
is replaced with other punctation mark, for example?
it works as expected.I'm new to
icu4x
so I wasn't able to prepare fix for that :/The text was updated successfully, but these errors were encountered: