Skip to content

Rapid discovery of novel prophages using biological feature engineering and machine learning

License

Notifications You must be signed in to change notification settings

ku-cbd/PhageBoost

Repository files navigation

PhageBoost

Rapid discovery of novel prophages using biological feature engineering and machine learning
TBA

Introduction

Predictions Prophage predictor based on gene features against a background.

Prophages are phages integrated into prokaryotic genomes that drive many aspects of bacterial biology. Their extreme diversity means they are challenging to detect using sequence similarity. We present a novel fast and generalizing machine learning method to facilitate novel phage discovery.

Publications

Sirén,K., Millard,A., Petersen,B., Gilbert,M.T.P., Clokie,M.R.J. and Sicheritz-Pontén,T. (2020) Rapid discovery of novel prophages using biological feature engineering and machine learning. 10.1101/2020.08.09.243022. https://www.biorxiv.org/content/10.1101/2020.08.09.243022v1.abstract

Getting Started

Installation

We have updated the base code to use XGBoost 2 and recent Python(3.11), BioPython and Pyrodigal. For now the git hub version should work. As we are working on PhageBoost version 2.0 the current PyPI and notebook version will maybe fixed.

from PyPI

conda create -y -n PhageBoost-env python=3.7 
conda activate PhageBoost-env
pip install PhageBoost 
PhageBoost -h

from GitHub

conda create -y -n PhageBoost-env python=3.11 # python 3.9 - 3.12 should work
conda activate PhageBoost-env 
git clone git@github.com:ku-cbd/PhageBoost.git 
cd PhageBoost/ 
python setup.py bdist_wheel
pip install --user . 
PhageBoost -h

CLI

PhageBoost -h
PhageBoost -f example/data/NC_000907.fasta.gz -o results

Notebooks

There are basic notebook examples in the notebooks/ These notebooks provide a way how to bring your own genecalls to PhageBoost. You can connect your PhageBoost kernel to your pre-existing Jupyter via ipykernel:

conda activate PhageBoost
pip install ipykernel
python -m ipykernel install --user --name PhageBoost --display-name "PhageBoost" 

About

Rapid discovery of novel prophages using biological feature engineering and machine learning

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •