Skip to content

KCCG/wrangling-genomics

 
 

Repository files navigation

Wrangling Genomics

Lesson for quality control and wrangling genomics data. This repository is maintained by John Reeves and has been forked from the original Data Carpentry workshop by Josh Herr, Ming Tang, and Fotis Psomopoulos. This fork has been modified for use with the Garvan HPC (Wolfpack).

Background

Wrangling genomics trains novice learners on a variant calling workflow. Participants will learn how to evaluate sequence quality and what to do if it is not good. We will then cover aligning reads to a genome, and calling variants, as well as discussing different file formats. Results will be visualized. Finally, we will cover how to automate the process by building a shell script.

This lesson is part of the Data Carpentry Genomics Workshop.

Contribution

Code of Conduct

All participants should agree to abide by the Data Carpentry Code of Conduct.

Authors

Wrangling genomics is authored and maintained by the community.

Citation

Please cite as:

Erin Alison Becker, Taylor Reiter, Fotis Psomopoulos, Sheldon John McKay, Jessica Elizabeth Mizzi, Jason Williams, … Winni Kretzschmar. (2019, June). datacarpentry/wrangling-genomics: Data Carpentry: Genomics data wrangling and processing, June 2019 (Version v2019.06.1). Zenodo. http://doi.org/10.5281/zenodo.3260609

Packages

No packages published

Languages

  • Python 82.6%
  • R 7.7%
  • Makefile 6.5%
  • Shell 2.6%
  • Ruby 0.6%