MIRACUM-Pipe incorporates tools for detecting single nucleotide variants (SNVs), insertions and deletions (InDels), loss of heterozygosity (LoH), copy number variations (CNVs) as well as quality statistics including the coverage on WES data. Various functional prediction and annotation databases are integrated to automatically annotate the identi-fied variants. The workflow is designed as a fully automated “one-click” solution from the raw sequencing files to the PDF report containing quality assessments, the identified and annotated variants (SNVs, InDels and LoH), copy number variations as well as a functional enrichment of the SNVs and CNVs respectively.
MIRACUM-Pipe is intended for research use only and not for patient treatment, diagnosis and/or medical records!
If you use this repository please cite:
This repo is intended to be run as docker (see MIRACUM-Pipe-docker). Alternatively, you can pick a Galaxy version.
In some cases Docker or Galaxy might be inapropriate. Therefore, one can install this software on a Debian 10 system.
We offer installation scripts. In order to setup up the main components execute the following scripts.
# install all os requirements
./debian/setup.sh
# install all R-packages
Rscript RScripts/install_packages.R
# install tools
./tools/install.sh
Further one needs to install some components manually. See MIRACUM-Pipe-docker for more information. You can also simply apply the setup.sh file for this repo. Just download it into the root of your clone of this project.
Once you installed the system and setup the required tools and databases, you can simply run the pipeline by setting up patients and run ./miracum_pipe.sh
. For more documentation about this, see the documentation of MIRACUM-Pipe-docker.
For the databases contained in annovar updating or installing new databases is possible following the descriptions on the homepage of annovar. If you install additional databases they have to be added in the yaml configuration file, e.g. conf/custom.yaml
. Be aware that the additionally added databases are not considered in the PDF Report.
All neccessary informations about the included R-scripts can be found with the ?functionName function in R.
For testing MIRACUM-Pipe an example dataset is provided and downloaded via the setup.sh script. It contains the necessary capture region files (V5UTR.bed
, V5UTR_Targets.txt
), the sequence files (germline_R{1/2}.fastq.gz
, tumor_R{1/2}.fastq.gz
) and the patient.yaml
. For the sake of runtime the fastq files were adjusted to contain only chromosome 12.
- Patrick Metzger
- Raphael Scheible
- Niklas Reimer
- Maria Hess
- Martin Boeker
- Geoffroy Andrieux
- Melanie Börries
This work is licensed under GNU Affero General Public License version 3.
We thank
- The Molecular Tumor Board Freiburg Team
- The MIRACUM consortium
- The German Ministry of Education and Research (BMBF) for funding
- The developers from Control-FREEC for the code on CNV significance