-
Notifications
You must be signed in to change notification settings - Fork 4
Project: update dictionary
ShweataNHegde edited this page Jan 25, 2021
·
12 revisions
Software and protocols to update dictionaries.
Shweata N. Hegde
PMR, Ambreen Hamadani
- Dheeraj Kumar
- Pruthivrajan
- Plant Interns
Everybody is welcome to set requirements for their dictionary or for any dictionaries, in general.
- @ShweataNHegde: I've set a requirement list for
organization
dictionary, here. https://github.com/petermr/dictionary/issues/3
- merge entries with same WikidataID
- detect and eliminate scholarly articles, books, etc.
- add language wikipedia pages from wikidataID
- (SH) post-SPARQL filtering, or query refinement
- translate attributes into wikidata properties where possible (crossrefid => _p3153_crossrefid)
- remove unwanted terms (term value or wikidataID)
- Incomplete dictionaries(?)- Activity, extraction method, plant parts [Ask Emanuel] - Missing Wikidata items
- Revise description and extract synonym.
- Some terms have wikidata id of scholarly articles
- Does Wikidata id of terms still exist? (Some items might have either be moved or deleted since the dictionary was created) Write python code to go through the ids and check if they exist
- PMR: Write software to convert SPARQL output into the dictionary.
- volatile_compound
- PROBLEM: Chemicals have commas in them. AltLabel gives synonyms separated by a comma.
- PMR: Query all the chemicals automatically and look them up.
- Find out if the compounds are in CheBI
- found in taxon property - P703
- Genes dictionary - Contact Guilia