Skip to content

Latest commit

 

History

History
63 lines (57 loc) · 4.79 KB

TODO.md

File metadata and controls

63 lines (57 loc) · 4.79 KB

FIMDA: Finding Mutations in the Digital Age

This file intends to document the integration of the software component “SNP Extraction Tool for Human Variations“ (SETH) into the OpenMinTeD platform.

The focus rests on software development and integration.

Milestones

The following top level milestones do not strictly depend on each other:

  1. UIMA XMI data format serialization for SETH output
  2. REST endpoint
  3. dockerize

Implementation

  • acquire general knowledge about UIMA XMI:
    • Analysis Engines (AEs) produce Analysis Results (ARs): intro
    • Annotators (e.g. SETH) produce Annotations
    • an AR is represented as CAS (Common Analysis Structure): intro, references
    • a CAS contains the analyzed document, a type system and annotations
  • identify relevant UIMA XMI concepts/components e.g. CAS types:
    • annotation (describes a region of a document) -> MutationMention
    • (entity -> Mutation)
  • implement relevant CAS types (MVP)
  • move to full UIMA application
  • think about logging
  • think about multi threading (see UIMA Multi-threaded Applications)
  • implement rest service (MVP)
  • implement complete MutationAnnotation CAS type
    • identify relevant features
    • identify feature types
    • define mappings to CAS primitive types and/or integrate required SETH types into SethTypeSystem.xml
  • write unit test: produce UIMA json from input text (via spring)
  • create a release (maven how-to)
    • push to github with github-release-plugin
    • NOTE: private key does not work! credentials has to be stored in maven settings.xml (calling the goal with system parameters during deploy does not work!)
    • NOTE: to delete created (github) tags, execute in git root dir: git tag -d 0.0.1 && git push --tags -f
  • push image to Docker Hub
    • choose "good" docker.image.prefix, currently it is "dfki": has to be the username for docker hub
    • create docker account
    • tag image as latest
    • use maven plugin: mvn dockerfile:push -Ddockerfile.username=... -Ddockerfile.password=... use mvn release with maven settings.xml holding credentials (but -Ddockerfile.username=... -Ddockerfile.password=... is still possible)
  • license compliance
    • create list of included/used packages
  • write how-to-integrate NER service (?)