-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
to_bytes and from_bytes changes the token lemma #636
Comments
FYI, I am using 1.1.2 |
Thanks. I think the bug here is that the serializer tries to get away with not saving the lemmas, because it thinks it can recalculate them given the POS tags. This turns out to be untrue in this case, because the lemma is a special-case. Hmm. |
I see. Maybe as a stop gap for my project, is it possible to know for which words (like cant) can this problem arise? As in if it is a finite knowable set of words, I can just hackishly fix for those. |
Yes, the special-case rules are listed in |
thanks! |
Closing this and making #1045 the master issue. Work in progress for spaCy v2.0! |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
The text was updated successfully, but these errors were encountered: