-
-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add NCI Thesaurus #10
Conversation
@jsstevenson absolutely, I would love to accept external contributions! I would also like to write a manuscript about the current state of versioning in biomedical database and ontology world, and how bioversions could be useful for the community, so if you're thinking about this stuff too I'd be keen to learn more and see if you'd want to help write that paper |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Everything looks good! I suppose this was a quite simple one :) I added a request to extract the date of the current release as well.
Last thing - do you know if there is a specific page corresponding to each version? For example in BioGRID, there's a way to construct a URL for a given version, which is really nice. If that's possible it would be great, but not required because we all know NCIt is very difficult to figure out.
Unfortunately the NCIt FTP archives follow a folder structure that is a little hard to capture in a single f-string -- they place the current year's releases one level up from prior years (which are all housed in subdirectories for each year), eg
I'd definitely be interested in getting in touch -- one of our group's broader projects focuses on knowledgebase integration in the cancer variant interpretation space (https://cancervariants.org/projects/integration/), so we have a vested interest in things like data provenance and reproducibility. |
I'm going to merge now but if you could send a link to that FTP address I would appreciate it |
👍 |
Howdy! We're interested in potentially making use of this in a few of our projects. If you're receptive to PRs, I have a small handful of other sources that we draw from, in addition to NCIt (and let me know if I'm missing anything here).