Skip to content

ijerkovic/rna-seq

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rna-seq has few dependencies, depending on the genome aligner that is being used.
BWA, Bowtie, Bowtie2 or LISA can be used, but they have to be exported to the PATH variable then.

rna-seq is a tool for mapping rna reads against its genome, without knowing where the exon regions in genome initially are.

run 'make' to compile and generate the rna-seq binary

Usage: rna-seq <aligner> <genome.fa> <index_dir> <reads.fq> <result_file>

<aligner>      - 'bowtie', 'bowtie2', 'bwa' or 'lisa'
<genome.fa>    - fasta file containing genome
<index_dir>    - prefix used for aligners for storing temporary files
<reads.fq>     - fastq file with reads to be aligned
<result_file>  - file in sam format containing overall alignments

options:
--cov-limit [coverage-limit-number]    - potential exons with coverage limit lower than specified are discarded
--skip-building-index                  - when running on same genome index again no need to wait for index building
--res-cig [res.cig]                    - file in cig format for validating results
--sliding-window-size/-sws [size]      - size of the granulation for cutting initially unmatched reads into smaller chunks - used for spliced matching
--join-exons-threshold/-jet [size]     - if two exons are too close join them (default: 100)
--extend-exons-threshold/-eet [size]   - how much to extend exons on right and left side for finding reads that were spliced (default:50)
--get-histogram-data                   - creates a hist.in file showing coverage data before spliced alignment
--num-threads [num]                    - number of threads for spliced alignment


/src
contains main cpp code

/scripts
contains various scripts used for testing and data preparation
all the scripts have a --help option for more information
few of them are written in cpp so compilation is necessary

/lisa
contains various lisa related binaries, currently lisa is dependant on divsufsort and gflags that need to be in LD_LIBRARY_PATH.
Lisa is being developed independently of rna-seq, more info can be found here https://github.com/Zuza/dtra 

/bin
contains binaries

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published