Skip to content

mdozmorov/Methylation_notes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 

Repository files navigation

DNA methylation notes

License: MIT PR's Welcome

Methylation-related tools and genomics data analysis resources. Please, contribute and get in touch! See MDmisc notes for other programming and genomics-related notes.

Table of content

Pipelines

  • RnBeads 2.0 - Full analysis pipeline for Illumina 27K/450K/EPIC arrays, WGBS/RRBS/target bisulfite seq. Figure 1 - all steps, from QC, preprocessing, exploratory analysis, to differential methylation/region analysis. New analyses include sex prediction, genetic purity, CNV analysis, cell type inference, age prediction, and more.

  • MEAL - R/Bioconductor Package to integrate methylation and expression data. It can also perform methylation or expression analysis alone. Wraps other packages

  • meffil - Efficient algorithms for analyzing DNA methylation data. R implementation.

  • GLINT - methylation array data analysis pipeline. Illumina 27K/450K/EPIC. ReFACTor to adjust for tissue heterogeneity, inferring population structure, imputation, association testing, basic visualization. Python implementation.

Bisulfite sequening

  • Benchmarking of whole-genome bisulfite converted sequencing data (WGS methylation). Nine tools, tested on simulated data. BWA-mem and BSMAP perform best, Bismark and others are close, often faster and less memory demanding

  • Meth10X - Whole Genome Bisulfite Sequencing Alignment pipeline. Includes SNP calling, region calling (partial methylation, low, high etc.), basic downstream analyses. Designed for SGE/PBS clusters. Built based on in-house bash/python/perl/R script, Bpipe and common alignment tools.

  • bwa-meth - fast and accurate alignment of BS-Seq reads https://arxiv.org/abs/1401.1129.

  • GemBS - alignment to converted and regular reference genome, calling methylated CpGs.

  • MethylDackel - A (mostly) universal methylation extractor for BS-seq experiments. Will process a coordinate-sorted and indexed BAM or CRAM file containing some form of BS-seq alignments and extract per-base methylation metrics from them.

  • methylpy - processing bisulfite sequencing and NOME-seq data, supports single- and paired-end data, read trimming, QC, duplicate removal, outputs methylation states and/or open chromatin, differentially methylated region calling at single cytosine level, multi-group comparison.

  • MOABS - bisulfite sequencing data processing and differential analysis pipeline. Beta-binomial hierarchical model, priors estimated using an Empirical Bayes approach. Credible methylation differences (CDIFs) better capture the biology of methylation differences. Outperforms BSmooth. Can account for CNVs. C++ implementation

Preprocessing

Differential methylation

Deconvolution

  • deconvR - an R package designed for analyzing deconvolution of the bulk sample(s) using an atlas of reference omic signature profiles and a user-selected model. Works with RNA-seq, array- and whole genome DNA methylation.

  • https://github.com/stephaniehicks/methylCC - R/BioC package to estimate the cell composition of whole blood in DNA methylation samples in microarray or sequencing platforms

CNV

  • conumee - copy-number variation analysis using Illumina DNA methylation arrays

Integrative

Data

  • DNA methylation atlas of the mouse brain, single-nucleus sequencing (snmC-seq2, pipeline, ALLCools: ALL methyl-Cytosine tools). 103,982 nuclei (including 95,815 neurons and 8,167 non-neuronal cells) from 45 regions of the mouse cortex, hippocampus, striatum, pallidum and olfactory areas. 161 cell clusters with distinct spatial locations and projection targets. Integration with scATAC-seq confirms cell types and identifies enhancer-gene interactions. HDF5 data on GEO, and links to individual experiments and data in TSV format. Data on Nemoarchive. Merged methylome viewer. Brain Cell Methylation Viewer to view gene, brain region, cell type.
    Paper Liu, Hanqing, Jingtian Zhou, Wei Tian, Chongyuan Luo, Anna Bartlett, Andrew Aldridge, Jacinta Lucero, et al. “DNA Methylation Atlas of the Mouse Brain at Single-Cell Resolution.” Nature 598, no. 7879 (October 7, 2021): 120–28. https://doi.org/10.1038/s41586-020-03182-8.
  • EWASdb - EWASdb is the first epigenome-wide association database (first online at 2015, and first published on Nucleic Acids Res. 2018 Oct 13) which stores the results of 1319 EWAS studies associated with 302 diseases/phenotypes (p<1e-7). Three types of EWAS results were stored in EWASdb: EWAS for single epi-marker; EWAS for KEGG pathway and EWAS for GO (Gene Ontology) categories. http://www.bioapp.org/ewasdb/

  • EWAS Atlas - A knowledgebase of epigenome-wide association studies. https://bigd.big.ac.cn/ewas/index

  • CoRSIVs - 9926 correlated regions of systemin interindividual variation of DNA methylation. GTEx data. Enriched in subtelomeric regions, transposable elements, depleted in TFBSs. Enriched in Quiescent regions, repressive polycomb marks, depleted in heterochromatin, active promoters and enhancers (bivalent). Likely genetically driven. Supplementary matierial: table S2 - all significant 39,424, S3 - filtered 9,926, S13 - 1659 450K probes overlapping 819 CoRSIVs.

    • Gunasekara, Chathura J., C. Anthony Scott, Eleonora Laritsky, Maria S. Baker, Harry MacKay, Jack D. Duryea, Noah J. Kessler, et al. “A Genomic Atlas of Systemic Interindividual Epigenetic Variation in Humans.” Genome Biology 20, no. 1 (December 2019): 105. https://doi.org/10.1186/s13059-019-1708-1.
  • The 450K array measures the methylation status of 485,512 methylcytosine sites in the human genome at a single nucleotide resolution, representing approximately 1.5% of total genomic CpG sites [22126295, 21593595]. While the assayed CpG sites are concentrated around promoter regions and gene bodies, approximately 25% are located in intergenic regions [22126295].

Misc

  • Methylation inhibits TFBSs, but some factors, like homeodomain, POU, NFAT, prefer binding to methylated DNA. These TFs play a role in embryonic and organismal development. Yin, Yimeng, Ekaterina Morgunova, Arttu Jolma, Eevi Kaasinen, Biswajyoti Sahu, Syed Khund-Sayeed, Pratyush K. Das, et al. “Impact of Cytosine Methylation on DNA Binding Specificities of Human Transcription Factors.” Science (New York, N.Y.) 356, no. 6337 (May 5, 2017). doi:10.1126/science.aaj2239.

About

Notes on DNA methylation analysis

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published