The backbone of most proteins forms an open curve. To study their entanglement, a common strategy consists in searching for the presence of knots in their backbones using topological invariants. However, this approach requires to close the curve into a loop, which alters its geometry. Knoto-ID allows evaluating the entanglement of open curves without the need to close them, using the recent concept of knotoids which is a generalization of classical knot theory to open curves. Knoto-ID can analyse the global topology of the full chain as well as the local topology by exhaustively studying all subchains or only determining the knotted core. The use of Knoto-ID is not limited to proteins, it can be used to analyse any open curve in 3D space such as chromosomes, synthetic polymers, random walks, etc.
If you use this software for a publication, please cite:
J. Dorier, D. Goundaroulis, F. Benedetti and A. Stasiak, "Knoto-ID: a tool to study the entanglement of open protein chains using the concept of knotoids", Bioinformatics 34, 3402-3404 (2018).
If you use the knotoid classification from the internal database
(using --names-db=internal
) or from files
examples/knotoid_names_sphere.txt
, examples/knotoid_names_planar.txt
,
examples/knotoid_names_sphere_arrow.txt
,
examples/knotoid_names_planar_arrow.txt
, please cite:
D. Goundaroulis, J. Dorier and A. Stasiak, "A systematic classification of knotoids on the plane and on the sphere", arXiv:1902.07277 [math.GT].
Precompiled binary distributions, with a user guide and examples, are available for Linux, Mac OS X and Windows at https://github.com/sib-swiss/Knoto-ID/releases/latest.
Knoto-ID is a collection of command line tools to study knot and knotoid diagrams on the sphere and on the plane. Its main features are:
- Accept protein structures in Protein Data Bank (PDB) format, 3D curves in xyz format, knot and knotoid diagrams in extended Gauss code format or PD code format.
- Draw knot(oid) diagrams.
- Evaluate the following polynomial invariant: classical Jones polynomial for knots (closed curves), the Jones polynomial for knotoids (open curves projected on a sphere), the Turaev loop bracket for knotoids (open curves projected on a plane), the arrow polynomial (open curves projected on a sphere) and the loop arrow polynomial (open curves projected on a plane).
- Evaluate polynomial invariant for multiple projection direction and produce projection maps on the plane of spherical coordinates or directly on a 3D globe (in interactive webGL format). with the 3D curve used as input.
- Find knotted cores of 3D curves.
- Output a 3D curve (in interactive webGL format) and highlight its knotted core.
- Generate fingerprint matrices (for open curves) and disk matrices (for closed curves) to summarize entanglement of all subchains of a 3D curve.
Please read the user guide distributed with Knoto-ID for more information.
Dependencies:
- c++ compiler (e.g. g++)
- cmake
- boost (boost-graph, boost-random and boost-regex)
- bash (for
make test
) - For the documentation:
- latex with pdflatex, bibtex and packages adjustbox, tikz.
To compile:
mkdir build
cd build/
cmake -DCMAKE_INSTALL_PREFIX=install_path/ -DCMAKE_BUILD_TYPE=Release ..
make
make doc
make test
make install
- Mac OS X 10.12.6: Apple LLVM 9.0.0 compiler, cmake 3.9.4 from Homebrew, boost 1.65.1 from Homebrew.
- Mac OS X 10.13.3: Apple LLVM 9.0.0 compiler, cmake 3.10.2 from MacPorts, boost 1.66.0 from MacPorts.
- CentOS 6: g++ 4.4.7 compiler, cmake 2.8.12, boost 1.41.
- Ubuntu 16.04.3 LTS: g++ 5.4.0 compiler, cmake 3.5.1, boost 1.58.
- Fedora 25: g++ 6.4.1, cmake 3.9.1 boost 1.60.
- Fedora 25: MinGW Windows cross-compiler 6.4.0, cmake 3.9.1, boost 1.60.
Detailed usage instructions can be found in the user guide
install_path/doc/user_guide.pdf
generated during compilation.
Knoto-ID includes five scripts to convert pdb to xyz format, plot a 3D curve in webGL format, generate plots of projection maps, fingerprint matrices and disk matrices, and plot knot(oid) diagrams. To use these scripts, R (version>=3.1) must be installed along with the following packages:
- optparse.
- ggplot2 (version>=2.2.0).
- RColorBrewer.
- reshape2.
- geometry.
- geosphere.
- rgl.
- rmarkdown.
- Rpdb.
These package can installed by entering the following command in R:
install.packages(c("optparse","ggplot2","RColorBrewer","reshape2","geometry","geosphere","rgl","rmarkdown","Rpdb"))
To output in webGL format, pandoc must also be installed.
- Mac OS X 10.12.6: R 3.3.2, optparse 1.4.4, ggplot2 2.2.1, RColorBrewer 1.1-2, reshape2 1.4.2, geometry 0.3-6, geosphere 1.5-7, rgl 0.98.1, rmarkdown 1.9, Rpdb 2.3, pandoc 2.1.1.
- Mac OS X 10.13.3: R 3.4.3, optparse 1.4.4, ggplot2 2.2.1, RColorBrewer 1.1-2, reshape2 1.4.3, geometry 0.3-6, geosphere 1.5-7, rgl 0.99.9, rmarkdown 1.8, Rpdb 2.3, pandoc 1.12.4.2.
- Ubuntu 16.04.3 LTS: R 3.2.3, optparse 1.4.4, ggplot2 2.2.1, RColorBrewer 1.1-2, reshape2 1.4.3, geometry 0.3-6, geosphere 1.5-7, rgl 0.99.9, rmarkdown 1.9, Rpdb 2.3, pandoc 1.16.0.2.
- Fedora 25: R 3.4.2, optparse 1.4.4, ggplot2 2.2.1, RColorBrewer 1.1-2, reshape2 1.4.3, geometry 0.3-6, geosphere 1.5-7, rgl 0.99.9, rmarkdown 1.8, Rpdb 2.3, pandoc 1.17.0.3.
- Windows 10: R 3.4.3, optparse 1.4.4, ggplot2 2.2.1, RColorBrewer 1.1-2, reshape2 1.4.3, geometry 0.3-6, geosphere 1.5-8, rgl 0.99.9, rmarkdown 1.9, Rpdb 2.3, pandoc 2.1.2.
- Julien Dorier
- Dimos Goundaroulis
- Vital-IT, SIB Swiss Institute of Bioinformatics
- Center for Integrative Genomics, University of Lausanne
The implementation of the circle packing algorithm used to generate xyz coordinates for a knot(oid) diagram is based on the python version by David Eppstein: CirclePack.py, licensed under the MIT License (http://opensource.org/licenses/MIT).
Copyright (C) 2017 by SIB Swiss Institute of Bioinformatics, Julien Dorier and Dimos Goundaroulis.
This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.