-
Notifications
You must be signed in to change notification settings - Fork 856
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to Named Entity Recognize using Data Programming in Snorkel? #838
Comments
Hey,
A label function will only take one context of a candidate, not two. So the label functions that you see in the tutorials
would change to something like this
|
@fsonntag thanks for the response!! Just tacking on for @wenfeixiang1991 :
Hope some of this helps! |
Thank you a lot ! |
Hi,
My purpose is extracting two entities(Industry and Company) in every Chinese raw text(or sentence), and each entity including few Chinese Characters. Modeling strategy is LSTM + CRF, but the train data which tagging every single Character is key! So I want to get train labeled data in Data Programming way by using Candidate Extractor + Label Function which is featured in snorkel.
After I read intro and cdr in tutorial and issue #599 and #810, I have some question about how to NER using snorkel:
Industry = candidate_subclass('Industry', ['industry'])
and
Company = candidate_subclass('Company', ['company'])
, then I just get every sentence labeled Industry/Company or not in different notebook?, and the target is sentence classify? But in NER, don't we need tagging every word in sentence?Relationship = candidate_subclass('Relationship', ['person1', 'person2'], values=['Married', 'Employs', False])
,we can not label candidate multi-classes in viewer? what is the right manual of viewer?
Thank you very much!
The text was updated successfully, but these errors were encountered: