-
-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue on sentence splitting #68
Comments
Version 0.86 released. Your example is now parsed as a single sentence. Here's the parse: The det site The sentence is parsed correctly up until the final relative clause (the "that he had vowed to bring to an end"). This parse error was what was causing the previous sentence boundary failure, since the sentence boundaries are inferred from the syntactic structure. I've now implemented the technique from this paper: http://www.aclweb.org/anthology/J14-2002 (with a novel twist that I've written up, and is under review). Please try out the new version (being sure to redownload the model), and report prominent failures you come across. |
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs. |
I had an issue when splitting a paragraph to sentence using spaCy:
before:
"The new site amounts to a modest tweak to the existing U.S. approach in Iraq, and illustrates Obama's reluctance to escalate the fight and reintroduce U.S. soldiers into combat that he had vowed to bring to an end."
after:
"The new site amounts to a modest tweak to the existing U.S. approach in Iraq, and illustrates Obama's reluctance to escalate the fight and reintroduce U.S. soldiers into combat that he had"
"vowed to bring to an end."
The paragraph should not have been split.
The text was updated successfully, but these errors were encountered: