Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Modify DGV annotations to more general tab format to support complex SVs #307

Open
4 tasks
zhemingfan opened this issue Feb 9, 2022 · 0 comments
Open
4 tasks
Assignees
Labels
enhancement long read support Support for long read sequence data, e.g. from Oxford Nanopore or PacBio reference files To do with generating the reference input files
Milestone

Comments

@zhemingfan
Copy link
Collaborator

Overview

Currently, the Summary module takes in a tab delimited file (similar to a .bed file) as the input. For future iterations, we want to change it to a MAVIS .tab file format so that more information is included.

  • Change support in summary from tab delimited file to MAVIS file format, rename DGV to "known_sv" as SVs from more databases will be added
  • Add column for support for frequency of which a variant is observed
  • Add unit tests for data that pulls from external and internal Nanpore databases (decide on TSV or VCF file formats to support)
  • Add unique ID support to delineate which dataset a call is from
@zhemingfan zhemingfan added enhancement long read support Support for long read sequence data, e.g. from Oxford Nanopore or PacBio reference files To do with generating the reference input files labels Feb 9, 2022
@zhemingfan zhemingfan added this to the v3.0.0 milestone Feb 9, 2022
@zhemingfan zhemingfan self-assigned this Feb 9, 2022
@creisle creisle changed the title Add Nanopore support in the summary module Modify DGV annotations to more general tab format to support complex SVs Feb 19, 2022
@creisle creisle modified the milestones: v3.0.0, v3.1.0 Feb 22, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement long read support Support for long read sequence data, e.g. from Oxford Nanopore or PacBio reference files To do with generating the reference input files
Projects
None yet
Development

No branches or pull requests

2 participants