-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathnews.csv
We can make this file beautiful and searchable if this error is corrected: It looks like row 2 should actually have 7 columns, instead of 4 in line 1.
32 lines (32 loc) · 34.1 KB
/
news.csv
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
Publication from the lab in Nature Communications;https://www.nature.com/articles/s41467-024-53485-x;;Transgenerational transmission of post-zygotic mutations suggests symmetric contribution of first two blastomeres to human germline;posted October 23, 2024 by Alexej Abyzov;Little is known about the origin of germ cells in humans. We previously leveraged post-zygotic mutations to reconstruct zygote-rooted cell lineage ancestry trees in a phenotypically normal woman, termed NC0. Here, by sequencing the genome of her children and their father, we analyze the transmission of early pre-gastrulation lineages and corresponding mutations across human generations. We find that the germline in NC0 is polyclonal and is founded by at least two cells likely descending from the two blastomeres arising from the first zygotic cleavage. Analyzes of public data from several multi-children families and from 1934 familial quads confirm this finding in larger cohorts, revealing that known imbalances of up to 90:10 in early lineages allocation in somatic tissues are not reflected in mutation transmission to offspring, establishing a fundamental difference in lineage allocation between the soma and the germline. Analyzes of all the data consistently suggest that the germline has a balanced 50:50 lineage allocation from the first two blastomeres.
Publication from the lab in Bioinformatics;https://academic.oup.com/bioinformatics/article/40/8/btae453/7715874;;Genome-wide analysis and visualization of copy number with CNVpytor in igv.js;posted August 17, 2024 by Alexej Abyzov;Copy number variation (CNV) and alteration (CNA) analysis is a crucial component in many genomic studies and its applications span from basic research to clinic diagnostics and personalized medicine. CNVpytor is a tool featuring a read depth-based caller and combined read depth and B-allele frequency (BAF) based 2D caller to find CNVs and CNAs. The tool stores processed intermediate data and CNV/CNA calls in a compact HDF5 file—pytor file. Here, we describe a new track in igv.js that utilizes pytor and whole genome variant files as input for on-the-fly read depth and BAF visualization, CNV/CNA calling and analysis. Embedding into HTML pages and Jupiter Notebooks enables convenient remote data access and visualization simplifying interpretation and analysis of omics data.
Publication from the lab in Scientific Reports;https://www.nature.com/articles/s41598-024-54302-7;;Characterization of enhancer activity in early human neurodevelopment using Massively Parallel Reporter Assay (MPRA) and forebrain organoids;posted February 23, 2024 by Alexej Abyzov;Regulation of gene expression through enhancers is one of the major processes shaping the structure and function of the human brain during development. High-throughput assays have predicted thousands of enhancers involved in neurodevelopment, and confirming their activity through orthogonal functional assays is crucial. Here, we utilized Massively Parallel Reporter Assays (MPRAs) in stem cells and forebrain organoids to evaluate the activity of ~ 7000 gene-linked enhancers previously identified in human fetal tissues and brain organoids. We used a Gaussian mixture model to evaluate the contribution of background noise in the measured activity signal to confirm the activity of ~ 35% of the tested enhancers, with most showing temporal-specific activity, suggesting their evolving role in neurodevelopment. The temporal specificity was further supported by the correlation of activity with gene expression. Our findings provide a valuable gene regulatory resource to the scientific community.
Publication from the lab in Scientific Data;https://www.nature.com/articles/s41597-023-02645-7;;Genomic data resources of the Brain Somatic Mosaicism Network for neuropsychiatric diseases;posted November 20, 2023 by Alexej Abyzov;Somatic mosaicism is defined as an occurrence of two or more populations of cells having genomic sequences differing at given loci in an individual who is derived from a single zygote. It is a characteristic of multicellular organisms that plays a crucial role in normal development and disease. To study the nature and extent of somatic mosaicism in autism spectrum disorder, bipolar disorder, focal cortical dysplasia, schizophrenia, and Tourette syndrome, a multi-institutional consortium called the Brain Somatic Mosaicism Network (BSMN) was formed through the National Institute of Mental Health (NIMH). In addition to genomic data of affected and neurotypical brains, the BSMN also developed and validated a best practices somatic single nucleotide variant calling workflow through the analysis of reference brain tissue. These resources, which include >400 terabytes of data from 1087 subjects, are now available to the research community via the NIMH Data Archive (NDA) and are described here.
Publication from the lab in Nature Neuroscience;https://www.nature.com/articles/s41593-023-01399-0;;Modeling idiopathic autism in forebrain organoids reveals an imbalance of excitatory cortical neuron subtypes during early neurogenesis;posted August 19, 2023 by Alexej Abyzov;Idiopathic autism spectrum disorder (ASD) is highly heterogeneous, and it remains unclear how convergent biological processes in affected individuals may give rise to symptoms. Here, using cortical organoids and single-cell transcriptomics, we modeled alterations in the forebrain development between boys with idiopathic ASD and their unaffected fathers in 13 families. Transcriptomic changes suggest that ASD pathogenesis in macrocephalic and normocephalic probands involves an opposite disruption of the balance between excitatory neurons of the dorsal cortical plate and other lineages such as early-generated neurons from the putative preplate. The imbalance stemmed from divergent expression of transcription factors driving cell fate during early cortical development. While we did not find genomic variants in probands that explained the observed transcriptomic alterations, a significant overlap between altered transcripts and reported ASD risk genes affected by rare variants suggests a degree of gene convergence between rare forms of ASD and the developmental transcriptome in idiopathic ASD.
Publication from the lab in The CRISPR Journal;https://www.liebertpub.com/doi/10.1089/crispr.2022.0050;;Clonally Selected Lines After CRISPR-Cas Editing Are Not Isogenic;posted June 14, 2023 by Alexej Abyzov;The CRISPR-Cas9 system has enabled researchers to precisely modify/edit the sequence of a genome. A typical editing experiment consists of two steps: (1) editing cultured cells, (2) cell cloning and selection of clones with and without intended edit, presumed to be isogenic. The application of CRISPR-Cas9 system may result in off-target edits, whereas cloning will reveal culture-acquired mutations. We analyzed the extent of the former and the latter by whole genome sequencing in three experiments involving separate genomic loci and conducted by three independent laboratories. In all experiments we hardly found any off-target edits, whereas detecting hundreds to thousands of single nucleotide mutations unique to each clone after relatively short culture of 10-20 passages. Notably, clones also differed in copy number alterations (CNAs) that were several kb to several mb in size and represented the largest source of genomic divergence among clones. We suggest that screening of clones for mutations and CNAs acquired in culture is a necessary step to allow correct interpretation of DNA editing experiments. Furthermore, since culture associated mutations are inevitable, we propose that experiments involving derivation of clonal lines should compare a mix of multiple unedited lines and a mix of multiple edited lines.
We are part of SMaHT network established by NIH Common Fund Program;https://www.nih.gov/news-events/news-releases/nih-launches-140-million-effort-investigate-genetic-variation-normal-human-cells-tissues;;NIH launches $140 million effort to investigate genetic variation in normal human cells and tissues. Common Fund Program will accelerate research on human development, aging, and disease.;posted May 11, 2023 by Alexej Abyzov;
Publication from the lab in Nucleic Acid Research;https://academic.oup.com/nar/advance-article/doi/10.1093/nar/gkad254/7110756;;Efficient reconstruction of cell lineage trees for cell ancestry and cancer;posted April 11, 2023 by Alexej Abyzov;Mosaic mutations can be used to track cell ancestries and reconstruct high-resolution lineage trees during cancer progression and during development, starting from the first cell divisions of the zygote. However, this approach requires sampling and analyzing the genomes of multiple cells, which can be redundant in lineage representation, limiting the scalability of the approach. We describe a strategy for cost- and time-efficient lineage reconstruction using clonal induced pluripotent stem cell lines from human skin fibroblasts. The approach leverages shallow sequencing coverage to assess the clonality of the lines, clusters redundant lines and sums their coverage to accurately discover mutations in the corresponding lineages. Only a fraction of lines needs to be sequenced to high coverage. We demonstrate the effectiveness of this approach for reconstructing lineage trees during development and in hematologic malignancies. We discuss and propose an optimal experimental design for reconstructing lineage trees.
Publication from the lab in Science;https://www.science.org/doi/10.1126/science.abm6222;;Analysis of somatic mutations in 131 human brains reveals aging-associated hypermutability;posted August 1, 2022 by TJ and Suki;We analyzed 131 human brains (44 neurotypical, 19 with Tourette syndrome, 9 with schizophrenia, and 59 with autism) for somatic mutations after whole genome sequencing to a depth of more than 200×. Typically, brains had 20 to 60 detectable single-nucleotide mutations, but ~6% of brains harbored hundreds of somatic mutations. Hypermutability was associated with age and damaging mutations in genes implicated in cancers and, in some brains, reflected in vivo clonal expansions. Somatic duplications, likely arising during development, were found in ~5% of normal and diseased brains, reflecting background mutagenesis. Brains with autism were associated with mutations creating putative transcription factor binding motifs in enhancer-like regions in the developing brain. The top-ranked affected motifs corresponded to MEIS (myeloid ectopic viral integration site) transcription factors, suggesting a potential link between their involvement in gene regulation and autism.
Publication from the lab in PLoS Compututional Biology;https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1009487;;All2: A tool for selecting mosaic mutations from comprehensive multi-cell comparisons;posted April 23, 2022 by Alexej Abyzov;Accurate discovery of somatic mutations in a cell is a challenge that partially lays in immaturity of dedicated analytical approaches. Approaches comparing a cell's genome to a control bulk sample miss common mutations, while approaches to find such mutations from bulk suffer from low sensitivity. We developed a tool, All2, which enables accurate filtering of mutations in a cell without the need for data from bulk(s). It is based on pair-wise comparisons of all cells to each other where every call for base pair substitution and indel is classified as either a germline variant, mosaic mutation, or false positive. As All2 allows for considering dropped-out regions, it is applicable to whole genome and exome analysis of cloned and amplified cells. By applying the approach to a variety of available data, we showed that its application reduces false positives, enables sensitive discovery of high frequency mutations, and is indispensable for conducting high resolution cell lineage tracing.
Publication from the lab in GigaScience;https://academic.oup.com/gigascience/article/10/11/giab074/6431715;;CNVpytor: a tool for copy number variation detection and analysis from read depth and allele imbalance in whole-genome sequencing;posted November 29, 2021 by Alexej Abyzov;Detecting copy number variations (CNVs) and copy number alterations (CNAs) based on whole-genome sequencing data is important for personalized genomics and treatment. CNVnator is one of the most popular tools for CNV/CNA discovery and analysis based on read depth. Herein, we present an extension of CNVnator developed in Python -- CNVpytor. CNVpytor inherits the reimplemented core engine of its predecessor and extends visualization, modularization, performance, and functionality. Additionally, CNVpytor uses B-allele frequency likelihood information from single-nucleotide polymorphisms and small indels data as additional evidence for CNVs/CNAs and as primary information for copy number neutral losses of heterozygosity. CNVpytor is significantly faster than CNVnator -- particularly for parsing alignment files (2-20 times faster) -- and has (20-50 times) smaller intermediate files. CNV calls can be filtered using several criteria, annotated, and merged over multiple samples. Modular architecture allows it to be used in shared and cloud environments such as Google Colab and Jupyter notebook. Data can be exported into JBrowse, while a lightweight plugin version of CNVpytor for JBrowse enables nearly instant and GUI-assisted analysis of CNVs by any user. CNVpytor release and the source code are available on GitHub at https://github.com/abyzovlab/CNVpytor under the MIT license.
We are highlighted in Nature Reviews Genetics;https://www.nature.com/articles/s41576-021-00358-4;;Human cell-lineage imbalances;posted April 1, 2021 by Alexej Abyzov;Various barcoding and labelling strategies have been developed for cell-lineage tracing ...
Publication from the lab in Genome Biology;https://genomebiology.biomedcentral.com/articles/10.1186/s13059-021-02285-3;;Comprehensive identification of somatic nucleotide variants in human brain tissue;posted March 29, 2021 by Alexej Abyzov;Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from 0.005 to 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
Publication from the lab in Science;https://science.sciencemag.org/content/371/6535/1245;;Early developmental asymmetries in cell lineage trees in living individuals;posted March 18, 2021 by Alexej Abyzov;Mosaic mutations can be used to track cell lineages in humans. We used cell cloning to analyze embryonic cell lineages in two living individuals and a postmortem human specimen. Of 10 reconstructed postzygotic divisions, none resulted in balanced contributions of daughter lineages to tissues. In both living individuals, one of two lineages from the first cleavage was dominant across tissues, with 90% frequency in blood. We propose that the efficiency of DNA repair contributes to lineage imbalance. Allocation of lineages in postmortem brain correlated with anterior-posterior axis, associating lineage history with cell fate choices in embryos. We establish a minimally invasive framework for defining cell lineages in any living individual, which paves the way for studying their relevance in health and disease.
Publication from the lab is featured by Genome Research;https://genome.cshlp.org/content/30/12/1695;;Complex mosaic structural variations in human fetal brains;posted Dec 3, 2020 by Alexej Abyzov;Somatic mosaicism, manifesting as single nucleotide variants (SNVs), mobile element insertions, and structural changes in the DNA, is a common phenomenon in human brain cells, with potential functional consequences. Using a clonal approach, we previously detected 200-400 mosaic SNVs per cell in three human fetal brains (15-21 wk postconception). However, structural variation in the human fetal brain has not yet been investigated. Here, we discover and validate four mosaic structural variants (SVs) in the same brains and resolve their precise breakpoints. The SVs were of kilobase scale and complex, consisting of deletion(s) and rearranged genomic fragments, which sometimes originated from different chromosomes. Sequences at the breakpoints of these rearrangements had microhomologies, suggesting their origin from replication errors. One SV was found in two clones, and we timed its origin to ~14 wk postconception. No large scale mosaic copy number variants (CNVs) were detectable in normal fetal human brains, suggesting that previously reported megabase-scale CNVs in neurons arise at later stages of development. By reanalysis of public single nuclei data from adult brain neurons, we detected an extrachromosomal circular DNA event. Our study reveals the existence of mosaic SVs in the developing human brain, likely arising from cell proliferation during mid-neurogenesis. Although relatively rare compared to SNVs and present in ~10% of neurons, SVs in developing human brain affect a comparable number of bases in the genome (~6200 vs. ~4000 bp), implying that they may have similar functional consequences.
Publication from the lab in BMC Bioinformatics;https://bmcbioinformatics.biomedcentral.com/articles/10.1186/s12859-020-03858-y;;SCELLECTOR: ranking amplification bias in single cells using shallow sequencing;posted Nov 13, 2020 by Alexej Abyzov;The study of mosaic mutation is important since it has been linked to cancer and various disorders. Single cell sequencing has become a powerful tool to study the genome of individual cells for the detection of mosaic mutations. The amount of DNA in a single cell needs to be amplified before sequencing and multiple displacement amplification (MDA) is widely used owing to its low error rate and long fragment length of amplified DNA. However, the phi29 polymerase used in MDA is sensitive to template fragmentation and presence of sites with DNA damage that can lead to biases such as allelic imbalance, uneven coverage and over representation of C to T mutations. It is therefore important to select cells with uniform amplification to decrease false positives and increase sensitivity for mosaic mutation detection. We propose a method, Scellector (single cell selector), which uses haplotype information to detect amplification quality in shallow coverage sequencing data. We tested Scellector on single human neuronal cells, obtained in vitro and amplified by MDA. Qualities were estimated from shallow sequencing with coverage as low as 0.3× per cell and then confirmed using 30× deep coverage sequencing. The high concordance between shallow and high coverage data validated the method. Scellector can potentially be used to rank amplifications obtained from single cell platforms relying on a MDA-like amplification step, such as Chromium Single Cell profiling solution.
We are featured in neuroDEVELOPMENTS;https://mailchi.mp/libd/neurodevelopments-1361622;;Mosaics in mind;posted Oct 27, 2020 by Alexej Abyzov;For this issue of neuroDEVELOPMENTS we focus on the startling reality of the mosaic nature of genomes in the human brain. Since the meeting of Craig Venter, Francis Collins, and Bill Clinton at the White House on June 6, 2000 to announce the first draft of the human genome, the idea that we all carry our own version of the human genetic code is commonplace. It is now clear that this is a simplified view of reality because every cell in our body does not have precisely the same genome ...;
Publication from the lab in Annual Review of Genomics and Human Genetics;https://doi.org/10.1146/annurev-genom-083118-015241;;Cell Lineage Tracing and Cellular Diversity in Humans;posted Aug 12, 2020 by Alexej Abyzov;Tracing cell lineages is fundamental for understanding the rules governing development in multicellular organisms and delineating complex biological processes involving the differentiation of multiple cell types with distinct lineage hierarchies. In humans, experimental lineage tracing is unethical, and one has to rely on natural-mutation markers that are created within cells as they proliferate and age. Recent studies have demonstrated that it is now possible to trace lineages in normal, noncancerous cells with a variety of data types using natural variations in the nuclear and mitochondrial DNA as well as variations in DNA methylation status. It is also apparent that the scientific community is on the verge of being able to make a comprehensive and detailed cell lineage map of human embryonic and fetal development. In this review, we discuss the advantages and disadvantages of different approaches and markers for lineage tracing. We also describe the general conceptual design for how to derive a lineage map for humans.;
Publication from the lab in Bioinformatics;https://doi.org/10.1093/bioinformatics/btaa703;;LongAGE: defining breakpoints of genomic structural variants through optimal and memory efficient alignments of long reads;posted Aug 12, 2020 by Alexej Abyzov;Defining the precise location of structural variations (SVs) at single-nucleotide breakpoint resolution is a challenging problem due to large gaps in alignment. Previously, Alignment with Gap Excision (AGE) enabled us to define breakpoints of SVs at single-nucleotide resolution, however, AGE requires a vast amount of memory when aligning a pair of long sequences. To address this, we developed a memory-efficient implementation - LongAGE - based on the classical Hirschberg algorithm. We demonstrate an application of LongAGE for resolving breakpoints of SVs embedded into segmental duplications on Pacific Biosciences (PacBio) reads that can be longer than 10Kbp. Furthermore, we observed different breakpoints for a deletion and a duplication in the same locus, providing direct evidence that such multi-allelic copy number variants (mCNVs) arise from two or more independent ancestral mutations.;
New lab member!!!;;;;posted Feb 17, 2020 by Alexej Abyzov;Postdoctoral fellow Yifan Wang, Ph.D. joined the lab on February 17, 2020. Welcome!
New lab member!!!;;;;posted Sep 2, 2019 by Alexej Abyzov;Postdoctoral fellow Yeongjun Jang, Ph.D. joined the lab on September 2, 2019. Welcome!
New lab member!!!;;;;posted June 10, 2019 by Alexej Abyzov;Postdoctoral fellow Arijit Panda, Ph.D. joined the lab on June 10, 2019. Welcome!
New lab member!!!;;;;posted Oct 16, 2018 by Alexej Abyzov;Postdoctoral fellow Shobana Sekar, Ph.D. joined the lab on October 8, 2018. Welcome!
New lab member!!!;;;;posted Oct 7, 2018 by Alexej Abyzov;Postdoctoral fellow Milovan Suvakov, Ph.D. joined the lab on September 4, 2018. Welcome!
Publication from the lab in Nucleic Acids Research;https://doi.org/10.1093/nar/gkz103;;Chromatin organization modulates the origin of heritable structural variations in human genome;posted Feb 18, 2019 by Alexej Abyzov;Structural variations (SVs) in the human genome originate from different mechanisms related to DNA repair, replication errors, and retrotransposition. Our analyses of 26 927 SVs from the 1000 Genomes Project revealed differential distributions and consequences of SVs of different origin, e.g. deletions from non-allelic homologous recombination (NAHR) are more prone to disrupt chromatin organization while processed pseudogenes can create accessible chromatin. Spontaneous double stranded breaks (DSBs) are the best predictor of enrichment of NAHR deletions in open chromatin. This evidence, along with strong physical interaction of NAHR breakpoints belonging to the same deletion suggests that majority of NAHR deletions are non-meiotic i.e. originate from errors during homology directed repair (HDR) of spontaneous DSBs. In turn, the origin of the spontaneous DSBs is associated with transcription factor binding in accessible chromatin revealing the vulnerability of functional, open chromatin. The chromatin itself is enriched with repeats, particularly fixed Alu elements that provide the homology required to maintain stability via HDR. Through co-localization of fixed Alus and NAHR deletions in open chromatin we hypothesize that old Alu expansion had a stabilizing role on the human genome.
Publication from the lab in Science;http://doi.org/10.1126/science.aat6720;;Transcriptome and epigenome landscape of human cortical development modeled in organoids;posted Dec 13, 2018 by Alexej Abyzov;Genes implicated in neuropsychiatric disorders are active in human fetal brain, yet difficult to study in a longitudinal fashion. We demonstrate that organoids from human pluripotent cells model cerebral cortical development on the molecular level before 16 weeks postconception. A multiomics analysis revealed differentially active genes and enhancers, with the greatest changes occurring at the transition from stem cells to progenitors. Networks of converging gene and enhancer modules were as sembled into six and four global patterns of expression and activity across time. A pattern with progressive down-regulation was enriched with human-gained enhancers, suggesting their importance in early human brain development. A few convergent gene and enhancer modules were enriched in autism-associated genes and genomic variants in autistic children. The organoid model helps identify functional elements that may drive disease onset.
Publication from the lab in Science;http://doi.org/10.1126/science.aan8690;;Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis;posted Dec 7, 2017 by Alexej Abyzov;Somatic mosaicism in the human brain may alter function of individual neurons. We analyzed genomes of single cells from the forebrains of three human fetuses (15 to 21 weeks post-conception) using clonal cell populations. We detected 200-400 single nucleotide variations (SNVs) per cell. SNV patterns resembled those found in cancer cell genomes, indicating a role of background mutagenesis in cancer. SNVs with a frequency of >2% in brain were shared with the spleen, revealing a pregastrulation origin. We reconstructed cell lineages for the first five post-zygotic cleavages and calculated a mutation rate of ~1.3 per division per cell. Later in development, during neurogenesis, the mutation spectrum shifted toward oxidative damage and the mutation rate increased. Both neurogenesis and early embryogenesis exhibit drastically more mutagenesis than adulthood.
Publication from the lab in Science;http://science.sciencemag.org/content/356/6336/eaal1641.full;;Intersection of diverse neuronal genomes and neuropsychiatric disease: The Brain Somatic Mosaicism Network;posted Apr 27, 2017 by Alexej Abyzov;Neuropsychiatric disorders have a complex genetic architecture. Human genetic population-based studies have identified numerous heritable sequence and structural genomic variants associated with susceptibility to neuropsychiatric disease. However, these germline variants do not fully account for disease risk. During brain development, progenitor cells undergo billions of cell divisions to generate the ~80 billion neurons in the brain. The failure to accurately repair DNA damage arising during replication, transcription, and cellular metabolism amid this dramatic cellular expansion can lead to somatic mutations. Somatic mutations that alter subsets of neuronal transcriptomes and proteomes can, in turn, affect cell proliferation and survival and lead to neurodevelopmental disorders. The long life span of individual neurons and the direct relationship between neural circuits and behavior suggest that somatic mutations in small populations of neurons can significantly affect individual neurodevelopment. The Brain Somatic Mosaicism Network has been founded to study somatic mosaicism both in neurotypical human brains and in the context of complex neuropsychiatric disorders.
Publication from the lab in Genome Research;http://genome.cshlp.org/content/early/2017/02/24/gr.215517.116.abstract;;One thousand somatic SNVs per skin fibroblast cell set baseline of mosaic mutational load with patterns that suggest proliferative origin;posted Mar 7, 2017 by Alexej Abyzov;Few studies have been conducted to understand post-zygotic accumulation of mutations in cells of the healthy human body. We reprogrammed 32 skin fibroblast cells from families of donors into human induced pluripotent stem cell (hiPSC) lines. The clonal nature of hiPSC lines allows a high-resolution analysis of the genomes of the founder fibroblast cells without being confounded by the artifacts of single cell whole genome amplification. We estimate that on average a fibroblast cell in children has 1,035 mostly benign mosaic SNVs. On average, 235 SNVs could be directly confirmed in the original fibroblast population by ultra-deep sequencing, down to an allele frequency (AF) of 0.1%. More sensitive droplet digital PCR experiments confirmed more SNVs as mosaic with AF as low as 0.01%, suggesting that 1,035 mosaic SNVs per fibroblast cell is the true average. Similar analyses in adults revealed no significant increase in the number of SNVs per cell, suggesting that a major fraction of mosaic SNVs in fibroblasts arises during development. Mosaic SNVs were distributed uniformly across the genome and were enriched in a mutational signature previously observed in cancers and in de novo variants and which, we hypothesize, is a hallmark of normal cell proliferation. Finally, AF distribution of mosaic SNVs had distinct narrow peaks, which could be a characteristic of clonal cell selection, clonal expansion, or both. These findings reveal a large degree of somatic mosaicism in healthy human tissues, link de novo and cancer mutations to somatic mosaicism and couple somatic mosaicism with cell proliferation.
Publication from the lab in Genome Research;http://genome.cshlp.org/content/early/2016/05/23/gr.205484.116.abstract;;Elevated variant density around SVs breakpoints in germline lineage lends support to error prone replication hypothesis;posted May 29, 2016 by Taejeong Bae;Copy number variants (CNVs) are a class of structural variants that may involve complex genomic rearrangements (CGRs) and that are hypothesized to have additional mutations around their breakpoints. Understanding the mechanisms underlying CNV formation is fundamental for understanding the repair and mutation mechanisms in cells, thereby shedding light on evolution, genomic disorders, cancer, and complex human traits. In this study, we employ data from the 1000 Genomes Project, to analyze hundreds of loci harboring heterozygous germline deletions in the subjects NA12878 and NA19240. By utilizing synthetic long-read data (longer than 2 kbp) in combination with high coverage short-read data and, in parallel, by comparing with parental genomes, we interrogated the phasing of these deletions with the flanking tens of thousands of heterozygous SNPs and indels. We found, that the density of SNPs/indels flanking the breakpoints of deletions (in-phase variants) is approximately twice as high as the corresponding density for the variants on the haplotype without deletion (out-of-phase variants). This fold-change was even larger, for the subset of deletions with signatures of replication-based mechanism of formation. The allele frequency (AF) spectrum for deletions is enriched for rare events; and the AF spectrum for in-phase SNPs is shifted towards this deletion spectrum, thus offering evidence consistent with the concomitance of the in-phase SNPs/indels with the deletion events. These findings, therefore, lend support to the hypothesis that the mutational mechanisms underlying CNV formation are error prone. Our results could also be relevant for resolving mutation rate discrepancies in human and to explain kataegis.
New lab member!!!;;;;posted Feb 17, 2016 by Alexej Abyzov;Postdoctoral fellow Tanmoy Roychowdhury, Ph.D. joined the lab on February 8, 2016. Welcome!
Publication from the lab in Nature;http://www.nature.com/nature/journal/v526/n7571/full/nature15394.html;;An integrated map of structural variation in 2,504 human genomes;posted Oct 1, 2015 by Alexej Abyzov;Our collaborative work within the framework of the 1000 Genomes Project on discovery and analysis of genome structural variants has been published in Nature.<br><br>Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association.