-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mesh Proteins as chemicals #200
Comments
I took a look at the first one here. https://id.nlm.nih.gov/mesh/D011972.html (Insulin receptor). According to the mesh code, this MESH id should not be included as a Chemical. As that URL shows, the Tree values are D12.776 and D08, both of which are excluded in the chemical.py mesh filter. Not sure at this point whether the MESH is somehow getting into the chemical id list or if we're looking at an old result somehow or what. |
OK, what I think is going on is that the MESH terms are correctly being put under Protein, but the UMLS are still getting called ChemicalEntities. Then the MESH terms are getting dragged along via a mapping. And I think that the reason that the UMLS are not working corrrectly is that our list of UMLS Tree id's doesn't use excludes. So Insulin Receptor has three listings in MRSTY:
So even though we don't let in Receptor, we do let in Enzyme. We need to instead say "if you are a receptor, you don't go here, no matter what your other listings say" |
It also looks like 1.4.1.2.1.7 is being grabbed by protein. So basically we need to
|
See NCATSTranslator/Feedback#613 NCATSTranslator/Feedback#614 NCATSTranslator/Feedback#615.
These are all proteins, which under biolink are biological entities, but we're calling them chemicals. I think that this is probably just never cleaned up from when protein went over into the biological entity branch.
The text was updated successfully, but these errors were encountered: