-
Notifications
You must be signed in to change notification settings - Fork 166
Home
InferCNV is used to explore tumor single cell RNA-Seq data to identify evidence for large-scale chromosomal copy number variations, such as gains or deletions of entire chromosomes or large segments of chromosomes. This is done by exploring expression intensity of genes across positions of the genome in comparison to a set of reference 'normal' cells. A heatmap is generated illustrating the relative expression intensities across each chromosome, and it often becomes readily apparent as to which regions of the genome are over-abundant or less-abundant as compared to normal cells.
InferCNV is one component of the TrinityCTAT toolkit focused on leveraging the use of RNA-Seq to better understand cancer transcriptomes. To find out more about Trinity CTAT please visit TrinityCTAT.
See Installing inferCNV for installation instructions. You have several choices.
Then move into the inferCNV folder and call the following function:
cd inferCNV/example
Rscript ./run.R
This will run inferCNV on test example data and should produce the figure at the bottom of this page.
- Python (2.X or 3.X)
- R (tested in R version 3.5.1 (2018-07-02) -- "Feather Spray")
- R libraries required include: RColorBrewer, gplots, futile.logger, stats, utils, methods, ape, Matrix, Seurat, binhf, fastcluster
inferCNV requires:
- a raw counts matrix of single-cell RNA-Seq expression
- an annotations file which indicates which cells are tumor vs. normal.
- a gene/chromosome positions file
See the above example to explore the example inputs provided. These required inputs are described in more detail below.
InferCNV is compatible with both smart-seq2 and 10x single cell transcriptome data, and presumably other methods (not tested). The counts matrix can be generated using any conventional single cell transcriptome quantification pipeline, yielding a matrix of genes (rows) vs. cells (columns) containing assigned read counts.
The format might look like so:
MGH54_P16_F12 | MGH54_P12_C10 | MGH54_P11_C11 | MGH54_P15_D06 | MGH54_P16_A03 | ... | |
---|---|---|---|---|---|---|
A2M | 0 | 0 | 0 | 0 | 0 | ... |
A4GALT | 0 | 0 | 0 | 0 | 0 | ... |
AAAS | 0 | 37 | 30 | 21 | 0 | ... |
AACS | 0 | 0 | 0 | 0 | 2 | ... |
AADAT | 0 | 0 | 0 | 0 | 0 | ... |
... | ... | ... | ... | ... | ... | ... |
Please use the following citation:
This methodology was also used in:
The following figure should be produced by the Quick Start instructions. This figure shows scRNA-Seq expression of oligodendroglioma with hallmark chr 1p and 19q deletions.
Now that you've gotten the example to work, use the menu in the upper right to navigate to the more detailed descriptions and instructions for exploring your own data.
- InferCNV Home
- Quick Start
- Installing inferCNV
- Running InferCNV
- Applying Noise Filters
- Predicting CNV via HMM
- Bayesian Mixture Model
- Tumor heterogeneity - define tumor subclusters
- Interpreting the Figure
- Inputs to InferCNV
- Outputs from InferCNV
- More inferCNV example data sets
- Using 10x data
- Interactively navigating data using the Next Generation Heatmap Viewer
- Extracting HMM features
- FAQ and common issues