-
-
Notifications
You must be signed in to change notification settings - Fork 205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Functions for natural language processing #472
Conversation
def _status_message(text, evaluation): | ||
# currently this uses "print" as everything else interferes with the test cases. | ||
# FIXME find a better solution that is clean and works with web based notebooks. | ||
print('# ' + text) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
evaluation.print_out
?
One thing to fix, if a specific
This can be fixed by installing all missing dataset but I think the best solution would be to wrap everything and catch/handle |
Spacy is currently down and you cannot install the language packages. Here's the deal: https://www.spacy.io/blog/announcement They have have an issue ticket on GitHub. Hope they can solve this soon. |
Looks good. Are you done working on this? |
Yes, I'm done. |
Opening a PR for this since I believe this is pretty complete, apart from some final QA passes.
Depends heavily on third party libraries, but thanks to
require
will handle this gracefully if packages are not installed.This whole thing is obviously somewhat experimental, but so is most of this functionality in MMA.
Everything tries to resemble MMA functions. Noteworthy functions:
Experimental
WordSimilaritydoes not exist in MMA. Yet, it's extremely powerful for semantic analysis so I couldn't help including it, though putting it in this
Experimental` context.TextStructure
gives slightly different results than MMA (it's not the same parser after all), and it currently only supports theConstituentString
mode, since everything else relies ofAssocation
s which we do not support (yet).I'm not sure what a good place for the installation notes (regarding
pattern
andomw
) in the top of the file would look like. Maybe somewhere in the docs?