-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Relation Extractor custom entities #359
Comments
There has been a lot of interest in making custom relation extraction training available, but I think the path forward is to make it easier to train models for new relations that work with the KBPAnnotator. I'm going to try to make sure there is clear documentation and any code changes necessary for that for Stanford CoreNLP 3.8.0. |
OK, cool, thanks for the prompt response. So the understanding is that the input format will be migrated from the Roth CONLL04 format to the KBP format for the training, and at that point this will become flexible. At this point I have a small customisation locally to adjust this and will continue to use such method. |
Yes that would be the plan. I'm going to start working on this and hopefully it won't take too long. By the way, if you happen to have any sample training data I could look at I am looking for an example so I can make sure my modifications are working properly. |
Hi @J38 - sorry it took a while for me to respond to you.
Please let me know your thoughts! 👍 |
Hello, Thanks for great Stanford tools! I would badly need to be able to train RE with custom entities for my project. I am not a professional (java) programmer (I am able to compile from source if proper instructions available) though and do not fully understand how to "change the code" as aoldoni suggested. Is the possibility of training custom relationships with custom entities available in 3.8? If not, how could I use the approach aoldoni suggested? I have available train corpus in original roth format available. Many thanks for reply! I am attaching small sample train file. |
Hi @rpalenik ,
Regarding this question specifically, please note:
|
Hi @aoldoni , Many thanks, I would need some more help. I understand I need to:
However, I got numerous compilation errors. Have I done it wrong? Can you pls. help with the right approach? thnx. |
Here is the output from the compiler...... |
Hi All,
Thanks for the great software. I would like to ask you the following please.
When training specific relations to be extracted from custom Entity types, using the Relation Extractor, I noted that the current possible entities are "hard-coded" in some parts, e.g.:
By modifying these 2 bits, one can re-use the Relation Extractor successfully with custom entities in case its needed, but this requires then a recompilation and an initial troubleshooting as to understand this.
Would you be interested in a pull-request that refactors these hard-coded methods in something that is obtainable from the properties file? E.g.: in the properties file one can indicate a "entitiesPath" option which would then point to a tab separated file with the normalised and not normalised values of these entities as its columns.
If this option is not provided potentially these default hard coded entities can then be used as to maintain the current behaviour.
This would cause potential Relation Extractor workflows with custom entities to be possible without code recompilation.
Please advise.
Again, thanks!
The text was updated successfully, but these errors were encountered: