-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Serialization: store_user_data is ignored in DocBin constructor #9190
Labels
Comments
polm
added
feat / doc
Feature: Doc, Span and Token objects
bug
Bugs and behaviour differing from documentation
labels
Sep 13, 2021
Thanks for the report, I confirmed this is still an issue with 3.1.2. (I also got some separate errors related to the attributes, though I need to look at that more.) We'll take a look at fixing this. |
polm
added a commit
to polm/spaCy
that referenced
this issue
Sep 16, 2021
3 tasks
svlandeg
pushed a commit
that referenced
this issue
Oct 1, 2021
This should be fixed by #9226. Thanks again for the report! |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Labels
How to reproduce the behaviour
When I try to create a
DocBin
out of documents that haveSpans
in custom attributes, I get an exception, even though I setstore_user_data = False
. The exception is thrown inDocBin.add()
function, line 113 of_serialize.py
file:The exception is thrown because
srsly.msgpack_dumps(doc.user_data)
cannot serializeSpans
. By mistake, it is called even whenself.store_user_data
is set toFalse
The following code reproduces the behavior:
The error
Your Environment
The text was updated successfully, but these errors were encountered: