You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you have a word like zurück in your documents, and you have this set of words to remove ['zur']
Then this step will remove zur in the word, converting zurück into ück.
That's happening because the function is using word boundaries (\b) which are known not to work with Unicode.
Expected
the function uses an unicode compatible regexp.
The text was updated successfully, but these errors were encountered:
Observed
If you have a word like
zurück
in your documents, and you have this set of words to remove['zur']
Then this step will remove
zur
in the word, convertingzurück
intoück
.That's happening because the function is using word boundaries (
\b
) which are known not to work with Unicode.Expected
The text was updated successfully, but these errors were encountered: