Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training Genotype Callers with Neural Networks #171

Open
agitter opened this issue Dec 31, 2016 · 0 comments
Open

Training Genotype Callers with Neural Networks #171

agitter opened this issue Dec 31, 2016 · 0 comments

Comments

@agitter
Copy link
Collaborator

agitter commented Dec 31, 2016

https://doi.org/10.1101/097469

We present an open source software toolkit for training deep learning models to call genotypes in high-throughput sequencing data. The software supports SAM, BAM, CRAM and Goby alignments and the training of models for a variety of experimental assays and analysis protocols. We evaluate this software in the Illumina platinum whole genome datasets and find that a deep learning model trained on 80% of the genome achieves a 0.986% accuracy on variants (genotype concordance) when trained with 10% of the data from a genome. The software is distributed at https://github.com/CampagneLaboratory/variationanalysis. The software makes it possible to train genotype calling models on consumer hardware with CPUs or GPU(s). It will enable individual investigators and small laboratories to train and evaluate their own models and to make open source contributions. We welcome contributions to extend this early prototype or evaluate its performance on other gold standard datasets.

This short paper extends the recent #99 for genotype calling. It is primarily a response to DeepVariant (#159). There are several paragraphs contrasting the two approaches and an emphasis on the open source code of this method.

This will fit in the sequencing sub-section of the Study section.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant