alntools processes next-generation sequencing read alignments into a sparse compressed incidence matrix (aka Equivalence Classes) and stores it in a pre-defined binary format for efficient downstream analyses and storage. It enables us to compare, contrast, or combine the results of different alignment strategies.
- Free software: MIT License
- Documentation: https://churchill-lab.github.io/alntools/
split
divides a large bam file into smaller onesbam2ec
preprocesses a bam file into binary equivalence class (EC) formatbam2emase
preprocesses a bam file into EMASE formatec2emase
converts binary EC file into EMASE formatemase2ec
converts EMASE format into binary EC formatrange
finds effective lengths of target sequences from alignment data
We are planning to add more features for preprocessing different formats of NGS read alignments at the population level and summarizing useful information from them. Stay tuned!