-
Notifications
You must be signed in to change notification settings - Fork 16
Pisces Paper Analysis
This webpage details how to recreate the Pisces results from "Pisces: An Accurate and Versatile Variant Caller for Somatic and Germline Next-Generation Sequencing Data". The instructions are given using Docker, but they can easily be adapted to a local install. Your host machine will need a TB of space.
Download the required data (genome, bam files, and truth data) to the host machine, with the following directory structure. All .gz files should be unzipped.
[YourHostDataDir]/
-
AppResults/
The contents of AppResults folder from Basespace project Pisces_Supplementary_Data_v1.0.1 go here. Includes bams and bedfiles needed.
-
genome/Homo_sapiens/UCSC/hg19/Sequence/WholeGenomeFasta/
The genome.fa and GenomeSize.xml from hg19 build of the human genome go here.
-
platinum/
The platinum genomes truth data for NA12877 and NA12878, downloaded from the following links:
https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/ConfidentRegions.bed.gz https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/ConfidentRegions.bed.gz.tbi https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/NA12877/NA12877.vcf.gz https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/NA12877/NA12877.vcf.gz.tbi https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/NA12878/NA12878.vcf.gz https://s3.eu-central-1.amazonaws.com/platinum-genomes/2017-1.0/hg19/small_variants/NA12878/NA12878.vcf.gz.tbi
(a) Install Docker, if you do not have it.
(b) Download Hap.py v0.3.10
(c) Replace the Hap.py docker file with the docker file located here: Docker image to rerun Pisces Paper analysis. This new Dockerfile fixes some issues with the standard Hap.py install, and installs Pisces on top of Hap.py. Copy run_analysis.sh into the same folder.
d) build Docker image
docker build -t pisces .
docker run -it --mount type=bind,source=[YourHostDataDir]/,target=/data pisces /bin/bash
Running the docker image will set up the analysis software, and copy over the analysis script run_analysis.sh.
Inside the docker container, run run_analysis.sh script. This script will generate the Pisces vcfs and the Hap.py / Som.py .csv files that give the precision and recall results, by sample.
With some minor path changes to suit your directory structure, this analysis script run_analysis.sh can be run locally on any windows or linux machine.
- Pisces 5.2.10 Design Document
- Pisces 5.2.10 Supported Options
- Scylla 5.2.10 Design Document
- Stitcher 5.2.10 Design Document
- VQR 5.2.10 Design Document
- VennVcf 5.2.10 Design Document
- Gemini 5.2.10 Design Document
- AdaptiveGenotyper 5.2.10 Design Document
- Pisces Tools 5.2.10
- Suggested Pipeline Configuration 5.2.10
- Pisces 5.2.9 Quick Start
- Pisces 5.2.9 Design Document
- Pisces 5.2.9 Supported Options
- Scylla 5.2.9 Design Document
- Stitcher 5.2.9 Design Document
- VQR 5.2.9 Design Document
- VennVcf 5.2.9 Design Document
- Pisces Tools 5.2.9
- Suggested Pipeline Configuration 5.2.9
- Pisces 5.2.7 Quick Start
- Pisces 5.2.7 Design Document
- Pisces 5.2.7 Supported Options
- Scylla 5.2.7 Design Document
- Stitcher 5.2.7 Design Document
- VQR 5.2.7 Design Document
- VennVcf 5.2.7 Design Document
- Pisces Tools 5.2.7
- Suggested Pipeline Configuration 5.2.7
- Pisces 5.2.5 Design Document
- Pisces 5.2.5 Supported Options
- Scylla 5.2.5 Design Document
- Stitcher 5.2.5 Design Document
- VQR 5.2.5 Design Document
- Suggested Pipeline Configuration 5.2.5
- Pisces 5.2.0 Design Document
- Pisces 5.2.0 Supported Options
- Scylla 5.2.0 Design Document
- Stitcher 5.2.0 Design Document
- VQR 5.2.0 Design Document
- Suggested Pipeline Configuration 5.2.0
- Pisces Suite 5.2.0 Known Issues and Limitations