You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I seem to get the wrong response for the following totally obvious case. Below is a description of Erika B Hess who is an artist (link). This entity is not part of wiki_2019 DB, however, there is another Erika Hess, the skier (link) present in the DB to which the ED links.
As the following code snippet shows, the context surrounding the first mention is very different from the Wikipedia description of the skier however, ED links to the skier with high confidence.
importrequestsAPI_URL="http://0.0.0.0:5555"text_doc="Erika Hess is a painter, curator, and the host of the I Like Your Work Podcast. Her paintings and drawings center around gender, motherhood, and the environment. In this episode, Erika and I talk about the emotions in her work, what she’s learned through her experience and why it’s so important to have a community behind you as an artist. Also mentioned in today’s episode: Widening the space as an artist 3:48 Erika’s background and her current work 6:41 How feelings and emotions are intertwined in Erika’s work 12:09 Being a mother and a painter 14:36 The importance of community in art, especially as a woman 20:50 How the I Like Your Work Podcast started and what it’s taught Erika 25:50 Erika’s membership and what encouraged her to start one 37:45 The importance of boundaries as an artist 44:53 If you enjoyed this episode, please rate, review and share it! Connect with Erika: Erika’s work: https://www.erikabhess.com/ Erika’s podcast: www.ilikeyourworkpodcast.com/home Erika’s membership: https://www.ilikeyourworkpodcast.com/yourlisteners"ed_result=requests.post(API_URL, json={
"text": text_doc,
"spans": [(0, 10)]
}).json()
print(ed_result)
From my experimentation, a threshold of 0.35 seems appropriate for accepting an ED result.
So, I launched into an experiment, wherein I replaced all the occurrences of Erika Hess with other famous ladies (none of whom are painters or artists) to see if REL links to them (even though - there's a complete mismatch between the mention context and entity Wikipedia embedding).
importrequestsAPI_URL="http://0.0.0.0:5555"text_doc="Erika Hess is a painter, curator, and the host of the I Like Your Work Podcast. Her paintings and drawings center around gender, motherhood, and the environment. In this episode, Erika and I talk about the emotions in her work, what she’s learned through her experience and why it’s so important to have a community behind you as an artist. Also mentioned in today’s episode: Widening the space as an artist 3:48 Erika’s background and her current work 6:41 How feelings and emotions are intertwined in Erika’s work 12:09 Being a mother and a painter 14:36 The importance of community in art, especially as a woman 20:50 How the I Like Your Work Podcast started and what it’s taught Erika 25:50 Erika’s membership and what encouraged her to start one 37:45 The importance of boundaries as an artist 44:53 If you enjoyed this episode, please rate, review and share it! Connect with Erika: Erika’s work: https://www.erikabhess.com/ Erika’s podcast: www.ilikeyourworkpodcast.com/home Erika’s membership: https://www.ilikeyourworkpodcast.com/yourlisteners"replacements= [
{'Erika': 'Kara', 'Hess': 'Swisher'}, # journalist
{'Erika': 'Nancy', 'Hess': 'Pelosi'}, # politician
{'Erika': 'Kamala', 'Hess': 'Harris'}, # politician
{'Erika': 'Michelle', 'Hess': 'Obama'}, # politician
{'Erika': 'Hillary', 'Hess': 'Clinton'}, # politician
{'Erika': 'Elizabeth', 'Hess': 'Warren'}, # politician
{'Erika': 'Alexandria', 'Hess': 'Ocasio-Cortez'}, # politician
{'Erika': 'Kim', 'Hess': 'Kardashian'}, # celibrity
]
forreplacementinreplacements:
text_doc_=text_docfork,vinreplacement.items():
text_doc_=text_doc_.replace(k, v)
# First two words is the mentionwords=text_doc_.split()[:2]
length=len(words[0]) +1+len(words[1])
span= (0, length)
ed_result=requests.post(API_URL, json={
"text": text_doc_,
"spans": [span]
}).json()
print(ed_result)
It seems REL ED seems to emphasize full-string match a lot and doesn't emphasize context and wiki embedding matching. This is a problem when evaluating ED on out-of-DB mentions which have a string match with another entity in the DB. Is this problem primarily because the REL ED system is not well-trained to disambiguate not in DB entities? I am running into this problem a lot where often times a single name mention (such as Barack) gets linked to an entity (such as Barack Obama), even though there's a complete mismatch between the context and the Wiki description of the linked entity.
Thanks!
The text was updated successfully, but these errors were encountered:
Hi,
I seem to get the wrong response for the following totally obvious case. Below is a description of
Erika B Hess
who is an artist (link). This entity is not part ofwiki_2019
DB, however, there is anotherErika Hess
, the skier (link) present in the DB to which the ED links.As the following code snippet shows, the context surrounding the first mention is very different from the Wikipedia description of the skier however, ED links to the skier with high confidence.
I get the following output:
From my experimentation, a threshold of
0.35
seems appropriate for accepting an ED result.So, I launched into an experiment, wherein I replaced all the occurrences of
Erika Hess
with other famous ladies (none of whom are painters or artists) to see if REL links to them (even though - there's a complete mismatch between the mention context and entity Wikipedia embedding).I get the following output:
It seems REL ED seems to emphasize full-string match a lot and doesn't emphasize context and wiki embedding matching. This is a problem when evaluating ED on out-of-DB mentions which have a string match with another entity in the DB. Is this problem primarily because the REL ED system is not well-trained to disambiguate not in DB entities? I am running into this problem a lot where often times a single name mention (such as
Barack
) gets linked to an entity (such asBarack Obama
), even though there's a complete mismatch between the context and the Wiki description of the linked entity.Thanks!
The text was updated successfully, but these errors were encountered: