This repository contains relevant code/pipelines for work done in our paper "Cocaine addiction-like behaviors are associated with long-term changes in gene regulation, energy metabolism, and GABAergic inhibition within the amygdala" (bioRxiv).
This directory contains code/pipelines for analysis of snRNA-seq data used in this project.
This pipeline runs cellranger-count
on FASTQ files for each sample to align them to an rn6 reference genome and creates Seurat objects for each sample along with QC plots.
This pipeline takes the output of the make_seurat_rna
pipline and subsets the cells based on QC metrics; normalizes the data; and integrates the data from all samples using RPCA. Cells are then clustered and marker genes of clusters are identified. All of this is performed with Seurat. Outputs of this pipeline are used for manual cell type annotation.
This pipeline takes annotated snRNA-seq data and performs differential gene expression analysis between high and low AI rats (as described in our paper) using MAST in Seurat. The config.yml
file can be modified to change the parameters of the test for differential expression.
This directory contains scripts used to run clusterProfiler on the annotated snRNA-seq data to perform cell type-specific GSEA.
This directory contains code/pipelines for analysis of snATAC-seq data used in this project.
This script contains code used to create a custom rn6 reference genome. The reference genome created here was used for aligning both the snATAC-seq data and the multiome data.
This directory contains scripts used to realign snATAC-seq FASTQs to the custom rn6 reference genome using cellranger-atac count
.
This pipeline takes the cellranger-atac count
outputs from the previous step and generates a combined peak matrix which is then used to generate new peak by cell matrices for each sample. These are then loaded into Signac and preprocessed accordingly to generate an integrated dataset across all samples (using Harmony). This pipeline also requires an annotated integrated snRNA-seq Seurat object for label transfer (for annotation of cell types in the snATAC-seq dataset).
This pipeline takes annotated snATAC-seq data as a Signac object and performs cell type-specific peak accessibility analysis between high and low AI rats. It also performs a permutation test and pulls out foreground/background sequences based on significance values, which may be useful for downstream enrichment analyses (e.g. motif finding).
This directory contains scripts used to coembed the snRNA-seq with the snATAC-seq.
This pipeline contains code used for performing a partitioned heritability analysis for enrichment of GWAS risk variants in cell type-specific open chromatin regions.
This pipeline uses MACS to call peaks on the BAM outputs of cellranger-atac count
in a cell type-specific manner. Requires a mapping of cell barcodes to cell type annotations. May be useful for downstream analyses.
This directory contains code used to perform differential analysis of transcription factors. Notebooks include pipelines for running chromVar analysis, as well as data visualization.
Questions? Contact jlz014 [at] eng [dot] ucsd [dot] edu