EnQA

A 3D-equivariant neural network for protein structure accuracy estimation

usage: python3 EnQA.py [-h] --input INPUT --output OUTPUT --method METHOD [--cpu] [--alphafold_prediction ALPHAFOLD_PREDICTION] [--alphafold_feature_cache ALPHAFOLD_FEATURE_CACHE] [--af2_pdb AF2_PDB]

Predict model quality and output NumPy array format.

optional arguments:
  -h, --help                  Show this help message and exit
  --input INPUT               Path to input pdb file.
  --output OUTPUT             Path to output folder.
  --method METHOD             Prediction method, can be "ensemble", "EGNN_Full", "se3_Full", "EGNN_esto9" or "EGNN_covariance". Ensemble can be done listing multiple models separated by comma.
  --alphafold_prediction      Path to alphafold prediction results.               
  --alphafold_feature_cache   Optional. Can cache AlphaFold features for models of the same sequence.
  --af2_pdb AF2_PDB           Optional. PDBs from AlphaFold predcition for index correction with input pdb when input PDB only contains partial sequence of the AlphaFold results.
  --cpu                       Optional. Force to use CPU.

Requirements:

biopandas==0.2.9

biopython==1.79

numpy==1.21.3

pandas==1.3.4

scipy==1.7.1

torch==1.10.0

equivariant_attention (Optional, used by models based on SE(3)-Transformer only)

pdb-tools (Optional, used by models with multiple chains only)

You may also need to set execution permission for utils/lddt and files under utils/SGCN/bin.

Note: Currently, the dependencies support AMD/Intel based system with Ubuntu 21.10 (Impish Indri). Other Linux-based system may be also supported but not guaranteed.

Example usages

Running an E(n)-equivariant model under example folder:

python3 EnQA.py --input example/model/6KYTP/test_model.pdb --output outputs/ --method EGNN_Full --alphafold_prediction example/alphafold_prediction/6KYTP/

If you want to run models based on the SE(3)-Transformer, then the Python package equivariant_attention is required and should be installed following Fabian's implementation.

Example:

python3 EnQA.py --input example/model/6KYTP/test_model.pdb --output outputs/ --method se3_Full --alphafold_prediction example/alphafold_prediction/6KYTP/

Feature generation using featurizers from Spherical graph convolutional networks

The featurizers from Spherical graph convolutional networks (S-GCN) are used to process 3D models of proteins represented as molecular graphs. Here we provide the voronota and spherical harmonics featurizer for Linux.

If you need to rebuild the voronota for a different system, please check out the S-GCN Repo.

Also, there are binaries built for featurizer under a different system. (Currently, only MacOS and Linux are supported)

Generating AlphaFold2 models for assisted quality assessment

For generating models using AlphaFold2, an installation of AlphaFold2 following its Official Repo is required. For our experiments, we use its original model used at CASP14 with no ensembling (--model_preset=monomer), with all genetic databases used at CASP14 (--db_preset=full_dbs), and restricts templates only to structures that were available at the start of CASP14 (--max_template_date=2020-05-14).

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
data		data
docker		docker
example		example
lddt-linux		lddt-linux
models		models
network		network
utils		utils
.gitignore		.gitignore
EnQA.py		EnQA.py
LICENSE		LICENSE
README.md		README.md
calculate_lddt.py		calculate_lddt.py
compare_lddt.py		compare_lddt.py
dataset.py		dataset.py
feature.py		feature.py
lddt.uu		lddt.uu
logging_compare_lddt.txt		logging_compare_lddt.txt
metrics.py		metrics.py
pdb_utils.py		pdb_utils.py
pdb_utils_crank.py		pdb_utils_crank.py
poetry.lock		poetry.lock
predict.py		predict.py
process.py		process.py
process_complex.py		process_complex.py
process_parallelise.py		process_parallelise.py
pyproject.toml		pyproject.toml
reindex_structure.py		reindex_structure.py
requirements.txt		requirements.txt
runEnQA.py		runEnQA.py
run_enqa_structs.py		run_enqa_structs.py
train.py		train.py
train_test_split.py		train_test_split.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EnQA

Requirements:

Example usages

Feature generation using featurizers from Spherical graph convolutional networks

Generating AlphaFold2 models for assisted quality assessment

About

Releases

Packages

Languages

License

biocad/EnQA

Folders and files

Latest commit

History

Repository files navigation

EnQA

Requirements:

Example usages

Feature generation using featurizers from Spherical graph convolutional networks

Generating AlphaFold2 models for assisted quality assessment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages