Skip to content

tanaylab/metacell

Repository files navigation

DOI License: MIT

metacell

NOTE: This is the old (original) package written in R. It has been superseded by the Python metacells package, which includes significant algorithmic improvements, and scales to analyzing millions of cells.

The MetaCell R package facilitates analysis of single cell RNA-seq UMI matrices by computing partitions of a cell similarity graph into small (~20-200 typically) homogeneous groups of cells which are defined as metacells (MCs). The derived MCs are then used for building different representations of the data, allowing matrix or 2D graph visualization forming a basis for analysis of cell types, subtypes, transcriptional gradients, cell-cycle variation, gene modules and their regulatory models and more. More details on the usage of the MetaCell pipeline is available in the package vignettes, and in papers using it.

References:

Method: Baran et al. 2018 (Genome Biology, bioarxiv).

Functions reference and usage vignettes are available in the package homepage.

Examples of applications:

Installation

if (!require("BiocManager")) install.packages('BiocManager') 
BiocManager::install("tanaylab/metacell")

Note: Metacell is implemented in R and C++. In particular it uses the Tanay group tgstat library that utilizes shared memory and distributed computing (as well as some specific optional CPU features). The package is tested on linux and macbooks, and is currently not compatible on Windows. A typical application will require at least 16G RAM. For heavier applications (100K cells) we recommend a dual CPU multi-core workstation with 128GM RAM or more.