-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create API for SuppKG (Dietary Supplements) #55
Comments
Pasted from Slack, my notes after reviewing the output file from:
I think the structure is okay for writing queries with x-bte annotation. Point 0: I wonder if the relation array ever gets a LOT of elements Point 1: Looking at the output file, some umls IDs seem to start with "DC" which seems incorrect. It looks like the "D" should be removed, so the ID starts with "C". Examples: Point 2: Looking at the output file, some IDs don't seem to match their names. Examples:
Point 3: Looking at the output file, some semantic types don't exist or don't seem to match the ID given
|
I think this has to do with the fact that suppKG apparently is using a (very) old version of UMLS. From their preprint: This may mean that we should perform some of the same analyses/filtering as we did for semmeddb, as described in biothings/semmeddb#2. |
(from looking at the materials + method section of the preprint) It sounds like the authors made some pseudo-UMLS IDs from "iDISK terms" that didn't map to an existing CUI....is that right? And that some of these "iDISK terms" were drug supplement ingredients...This makes me wonder about the MRCONSO.RRF file that they mention, which sound like it may have mappings from the original "iDISK terms" to pseudo-UMLS IDs used in their KG... Also, it sounds like they put "phsu" as the semantic type for all drug supplements for their work, even if the original UMLS ID isn't considered a Pharmacological Substance. This makes me think of the plant terms (Point 3 / bullet 2 in my above post). They also mention a networkx file and I wonder if that's useful... |
Hi @colleenXu , from SemRep_DS/docs/SemRep_full_fielded_output.txt:
|
@erikyao deployed the API at https://biothings.ncats.io/suppkg based on the parser written by @mnarayan1 (https://github.com/biothings/SuppKG). @colleenXu can you add creation of the smartAPI annotation to your to-do list please? ("Normal" priority -- no special urgency here...) Let's also leave this ticket open for the moment so we can contemplate enhancements to the parser (for example, to handle retired UMLS IDs, get more current human-readable names and semtypes, etc)... |
Related to NCATS-Tangerine/translator-api-registry#122 by @mnarayan1 |
Closing because the API has been made. The rest of the work and discussion can be moved to biothings/biothings_explorer#706 |
SuppKG contains a variety of edges for Dietary Supplements.
Publication: https://pubmed.ncbi.nlm.nih.gov/35709900/
Preprint: https://arxiv.org/abs/2106.12741
Download link: https://github.com/zhang-informatics/SemRep_DS/tree/main/SuppKG
There are 595222 entries under the
links
. Here is one example record:I believe we want to create a record like this (where the info for
name
can be found in thenodes
section of the json).The text was updated successfully, but these errors were encountered: