Skip to content

Project: update dictionary

ShweataNHegde edited this page Jan 25, 2021 · 12 revisions

Contents

Description

Software and protocols to update dictionaries.

Project Manager

Shweata N. Hegde

Technical Lead

PMR, Ambreen Hamadani

Members

  • Dheeraj Kumar
  • Pruthivrajan
  • Plant Interns

Requirments

Everybody is welcome to set requirements for their dictionary or for any dictionaries, in general.

Meeting Record

Date: 21st Jan. 2021

Requirements:

  • merge entries with same WikidataID
  • detect and eliminate scholarly articles, books, etc.
  • add language wikipedia pages from wikidataID
  • (SH) post-SPARQL filtering, or query refinement
  • translate attributes into wikidata properties where possible (crossrefid => _p3153_crossrefid)
  • remove unwanted terms (term value or wikidataID)

Date: 25th Jan. 2021

Requirements - CEVOpen

  • Incomplete dictionaries(?)- Activity, extraction method, plant parts [Ask Emanuel] - Missing Wikidata items
  • Revise description and extract synonym.
  • Some terms have wikidata id of scholarly articles
  • Does Wikidata id of terms still exist? (Some items might have either be moved or deleted since the dictionary was created) Write python code to go through the ids and check if they exist
  • PMR: Write software to convert SPARQL output into the dictionary.
  • volatile_compound
    • PROBLEM: Chemicals have commas in them. AltLabel gives synonyms separated by a comma.
    • PMR: Query all the chemicals automatically and look them up.
    • Find out if the compounds are in CheBI
    • found in taxon property - P703
  • Genes dictionary - Contact Guilia

Progress

Clone this wiki locally