tl;dr:
Click any launch binder
badge on this page to run command line-based BLAST inside your browser.
BLAST: Basic Local Alignment Search Tool (BLAST+) command-line applications.
This repository is for running BLAST+ in Jupyter environment provided by MyBinder.org.
Additionally, having BLAST+ working inside the Jupyter environment with interactive Python adds some convenient features that are illustrated. A utility script for moving command line-based BLAST results into Python is also demonstrated.
The BLAST software will be installed already in each active session launched from this repository. The BLAST software is available directly from the National Library of Medicine under BLAST+ executables.
BLAST is a registered trademark of the National Library of Medicine, see here.
The BLAST software references are listed here.
Users of BLAST+ here should probably cite:
-
BLAST+: architecture and applications. Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K., & Madden T.L. (2008) BMC Bioinformatics 10:421. PMID:20003500
-
BLAST: a more efficient report with usability improvements. Boratyn GM, Camacho C, Cooper PS, Coulouris G, Fong A, Ma N, Madden TL, Matten WT, McGinnis SD, Merezhuk Y, Raytselis Y, Sayers EW, Tao T, Ye J, & Zaretskaya I. (2013) Nucleic Acids Res. 41:W29-W33.PMID: 23609542
Clarifying Software Attribution: I, Wayne, am not involved in the BLAST software at all. Those at the National Library of Medicine are the developers and distributors of BLAST. See their materials. I simply set up this repository to make the software useable on the command line without installation headaches.
I, Wayne, did code a Python-based utility for use with the results from command line BLAST; it is available here and utilized in the notebooks in this repository to process the results and allow easily converting the results to other forms.
This repository is set up to allow running the command line version of BLAST software after pressing the launch binder
button above or below. The target use case is when you want to run custom BLAST or process a lot of sequences.
In the notebooks that can be launched, I have added some examples illustrating how to use the program and process the results easily with Python and convert to other forms. Additionally, useful resources for using command line BLAST are in those notebooks. Alternatively, the notebook with most of resources can be viewed statically here. The 'Credits/Resources' section right at the top is a good place to start.
The Binder-luanchable version too limiting for your needs?
NCBI has made available an Amazon Machine Image (AMI) that will generate a remote cloud compute instance already set to run standalone BLAST, see here. Additional documentation is here.
Obviously, web-based BLAST is also available. Here are some of my favorite web-based BLAST resources:
- NCBI's main BLAST page
- Saccharomyces Genome Database web interface
- SequenceServer lets you set up a BLAST+ server for individual use or for sharing datasets and lists several community databases that use it for a querying mechanism.
This repository is set up to make use of the binder service offered by MyBinder.org. See their site for more information about Binder.
Developed mainly from combining much of the binderized repo from here with the ability to also run PatMatch, see here for information about PatMatch and launchable Jupyter notebooks demonstrating its use.
I borrrowed the 'warning' highlight/introductory text about notebooks at the top of the included notebook from Tim Sherratt's notebook here.
Click this button below to begin using BLAST (or PatMatch, as well):