GitHub - JonnyTran/OpenOmics: A bioinformatics API to interface with public multi-omics bio databases for wicked fast data integration.

OpenOmics is currently under active development and we may break API compatibility in the future.

This Python package provide a series of tools to integrate and explore the genomics, transcriptomics, proteomics, and clinical data (aka multi-omics data). With interfaces to popular annotation databases and scalable data-frame manipulation tools, OpenOmics facilitates the common data wrangling tasks when preparing data for RNA-seq bioinformatics analysis.

Documentation (Latest | Stable) | OpenOmics at a glance

Features

OpenOmics assist in integration of heterogeneous multi-omics bioinformatics data. The library provides a Python API as well as an interactive Dash web interface. It features support for:

Genomics, Transcriptomics, Proteomics, and Clinical data.
Harmonization with 20+ popular annotation, interaction, disease-association databases.

OpenOmics also has an efficient data pipeline that bridges the popular data manipulation Pandas library and Dask distributed processing to address the following use cases:

Providing a standard pipeline for dataset indexing, table joining and querying, which are transparent and customizable for end-users.
Providing Efficient disk storage for large multi-omics dataset with Parquet data structures.
Integrating various data types including interactions and sequence data, then exporting to NetworkX graphs or data generators for down-stream machine learning.
Accessible by both developers and scientists with a Python API that works seamlessly with an external Galaxy tool interface or the built-in Dash web interface (WIP).

Installation:

PyPI

pip install openomics

Conda

conda install openomics -c jonnytran # Work in progress

From source

git clone https://github.com/JonnyTran/OpenOmics/
cd OpenOmics
pip install -e .

Citations

The journal paper for this scientific package was reviewed by JOSS at https://joss.theoj.org/papers/10.21105/joss.03249#, and can be cited with:

# BibTeX
@article{Tran2021,
  doi = {10.21105/joss.03249},
  url = {https://doi.org/10.21105/joss.03249},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {61},
  pages = {3249},
  author = {Nhat C. Tran and Jean X. Gao},
  title = {OpenOmics: A bioinformatics API to integrate multi-omics datasets and interface with public databases.},
  journal = {Journal of Open Source Software}
}

Credits

Thank you for extremely helpful feedback and guidance from the pyOpenSci reviewers. This package was created with the pyOpenSci/cookiecutter-pyopensci project template, based off audreyr/cookiecutter-pypackage.

Name		Name	Last commit message	Last commit date
Latest commit History 2,075 Commits
.github		.github
.idea		.idea
docs		docs
inst		inst
openomics		openomics
openomics_web		openomics_web
tests		tests
.deepsource.toml		.deepsource.toml
.editorconfig		.editorconfig
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Features

Installation:

PyPI

Conda

From source

Citations

Credits

About

Releases 4

Packages

Contributors 9

Languages

License

JonnyTran/OpenOmics

Folders and files

Latest commit

History

Repository files navigation

Features

Installation:

PyPI

Conda

From source

Citations

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 9

Languages

Packages