-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sentence boundary detection + semicolon #519
Comments
This problem will be fixed with the next data release. Closing this issue to keep this topic in one place (see #725). |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
My issue is in respect to the german sentence boarder disambiguation and semicolons.
I have a lot of cases where spacy would falsely break a sentence after ";" and a following SCONJ (subordinate conjunction), i.e. "; dass", "; wenn", "; ob". I am wondering why the SCONJ doesn't determine that we have one sentence. Shouldn't the syntactic parsing take that into account?
The text was updated successfully, but these errors were encountered: