Spatial-RNA-Analysis

Code Analysis and Visualization README

This README provides an overview of the Python code and its functionalities. The code performs various data analysis and visualization tasks using the Scanpy library. The purpose of this code is to analyze spatial transcriptomics data and generate visualizations to understand gene expression patterns. The data for the spatial analysis can be found here with the paper that is available here

Introduction

This Python code is designed to analyze and visualize spatial transcriptomics data using the Scanpy library. The code reads the data, performs data preprocessing steps, calculates quality control metrics, identifies highly variable genes, performs dimensionality reduction, and clusters cells. It also identifies marker genes for each cluster and generates visualizations for exploration.

Dependencies

To run this code, you'll need the following Python libraries:

Scanpy
Pandas
Matplotlib
Seaborn
Numpy

Install these dependencies using the following command:

pip install scanpy pandas matplotlib seaborn numpy

Code Explanation

The code can be divided into the following sections:

Data Loading and Preprocessing: The code starts by loading spatial transcriptomics data using sc.read_visium. It then performs data preprocessing steps like adding new columns to the data, filtering cells based on counts, and filtering genes.
Quality Control and Visualization: The code calculates quality control metrics using sc.pp.calculate_qc_metrics and generates histograms to visualize the distribution of certain metrics.
Gene Filtering and Analysis: The code further filters genes based on their expression levels and then performs principal component analysis (PCA), neighborhood graph construction, and UMAP embedding for dimensionality reduction and cell clustering. It also uses the Leiden algorithm for community detection.
Marker Gene Identification: The code identifies marker genes for each cluster using the Wilcoxon rank-sum test and selects significant markers based on adjusted p-values and fold changes.
Visualization: The code generates spatial and UMAP-based visualizations of clusters, gene expression, and other relevant information.

Usage

Install the required dependencies.
Copy and paste the code into a Python script or Jupyter Notebook.
Modify file paths and parameters as needed.
Run the code step by step to observe the data analysis and visualization process.

Results

The code provides insights into spatial transcriptomics data through various visualizations. It identifies clusters of cells, visualizes gene expression patterns, and highlights marker genes associated with each cluster. These results help in understanding the spatial distribution of gene expression in the tissue under study.

Please note that the provided code snippets might need adjustments according to your specific dataset and requirements. Make sure to customize file paths, parameters, and visualization options accordingly.

Cited paper

Kadur Lakshminarasimha Murthy, P., Sontake, V., Tata, A. et al. Human distal lung maps and lineage hierarchies reveal a bipotent progenitor. Nature 604, 111–119 (2022). https://doi.org/10.1038/s41586-022-04541-3

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
Scanpy.ipynb		Scanpy.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spatial-RNA-Analysis

Code Analysis and Visualization README

Table of Contents

Introduction

Dependencies

Code Explanation

Usage

Results

Cited paper

About

Releases

Packages

Languages

License

karan-nanda/Spatial-RNA-Analysis

Folders and files

Latest commit

History

Repository files navigation

Spatial-RNA-Analysis

Code Analysis and Visualization README

Table of Contents

Introduction

Dependencies

Code Explanation

Usage

Results

Cited paper

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages