GitHub - crickbabs/BABS-MNASeqPE: This pipeline has been superceeded. Please use >>>

Introduction

A Nextflow pipeline for processing paired-end Illumina MNASeq sequencing data.

The pipeline was written by The Bioinformatics & Biostatistics Group at The Francis Crick Institute, London.

Pipeline summary

Raw read QC (FastQC, Fastq Screen)
Adapter trimming (cutadapt)
Alignment (BWA)
Mark duplicates (picard)
Filtering to remove:
- reads that are marked as duplicates (SAMtools)
- reads that arent marked as primary alignments (SAMtools)
- reads that are unmapped (SAMtools)
- reads that map to multiple locations (SAMtools)
- reads containing > 3 mismatches in either read of the pair (BAMTools)
- reads that have a user-defined insert size (BAMTools)
- reads that are soft-clipped (BAMTools)
- reads that map to different chromosomes (Pysam)
- reads that arent in FR orientation (Pysam)
- reads where only one read of the pair fails the above criteria (Pysam)
Merge alignments at replicate-level (picard)
- Re-mark duplicates (picard)
- Remove duplicate reads (optional; SAMtools)
- Create normalised bigWig files scaled to 1 million mapped read pairs (BEDTools, wigToBigWig)
Call nucleosome positions and generate smoothed, normalised coverage wig files that can be used to generate occupancy profile plots between samples across features of interest (DANPOS2)
Create IGV session file containing bigWig tracks for data visualisation (IGV)
Collect and present QC at the raw read and alignment-level (MultiQC)

Documentation

The documentation for the pipeline can be found in the docs/ directory:

Pipeline DAG

Credits

The pipeline was written by the The Bioinformatics & Biostatistics Group at The Francis Crick Institute, London.

The pipeline was developed by Harshil Patel.

The NGI-RNAseq pipeline developed by Phil Ewels was used a template for this pipeline. Many thanks to Phil and the team at SciLifeLab.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
bin		bin
conf		conf
docs		docs
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
environment.yaml		environment.yaml
main.nf		main.nf
nextflow.config		nextflow.config
run_pipeline.sh		run_pipeline.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction

Pipeline summary

Documentation

Pipeline DAG

Credits

License

About

Releases

Packages

Languages

License

crickbabs/BABS-MNASeqPE

Folders and files

Latest commit

History

Repository files navigation

Introduction

Pipeline summary

Documentation

Pipeline DAG

Credits

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages