Enhancing Conformational Sampling for Intrinsically Disordered and Ordered Proteins by Variational Autoencoder
Overview of VAE: Variational Autoencoder (VAE) generates similar conformations based on a given MD trajectory, thus can be used to enhance the sampling of the diverse conformation for intrinsically disordered proteins (IDPs) and structural proteins.
- Compatible with Python 3.x.
- Dependencies can be installed using the
requirements.txt
file.
- Install all the requirements by executing
pip install -r requirements.txt.
- Install required protein
.pdb
processing librarybiobox
from this github repository. - Next execute
python preprocess.py pdb split
which aligns the conformations from inputpdb
and creates VAE-required dataset at certainsplit
. - To start a training run:
python packed_vae.py pdb
where pdb
denotes the filename of input MD trajectory.
- To calculate the RMSD between generated conformations and original ones:
python rmsd_min.py pdb
To conduct cluster analysis on generated conformations, please use the scripts in k-cluster
folder.
- Cluster analysis requires
MMTSB
tool set in this research, install the tool set from this website. - execute
list.pl
to get file list for further analysis. - execute
run.sh
to conduct cluster analysis.
To calculate secondary chemical shift of generated conformations, please use the scriptes in chemical-shift
folder.
- Calculation requires
SPARTA+
program from this website. - execute
split_pdb.py
to calculate chemical shift. - execute
avg_cs.py
to get average chemical shift of target conformation ensemble.