AlphaMut

This is the GitHub Repository accompanying the paper: "AlphaMut: a deep reinforcement learning model to suggest helix-disrupting mutations"

README

Running the Inference Model

To run the trained model(Helix-in-protein), you can run the colab notebook - 3_inference_of_Helix-in-protein_trained_model.ipynb. Instructions are provided in the colab notebook along with an illustrative example.

Training the Model

This code is meant to run the code for learning how to break helices using Reinforcement Learning. There are two models --- one that disrupts helices(Helix-only), and another that disrupts helices within a protein environment(Helix-in-protein).

Information on training is provided in the Jupyter Notebooks - 1_training_and_validation_only_helix.ipynb and 2_training_and_validation_with_protein.ipynb

Packages Required

It is advised to install all of the below packages in a conda environment(>= python 3.8). It is advised to use StableBaselines3 since it has standard ready-to-use implementations of RL Algorithms. StableBaselines3 also downloads Gymnasium, which is necessary for the reinforcement learning environment.

The following packages are required:

Biotite - this is to get protein structural embeddings (from P-SEA)¹ to obtain the reward
Transformers - this is to get the ESMFold² Model and the ESM embedding model.
biopandas - this is to read the initial pdb files.
StableBaselines3 for the RL algorithms.
BioVec - this is to embed the states. The states are described as protein sequences that are embedded in a 100 dimensional space using a pretrained model called ProtVec³. The other way to get the state is through the ESM-2 model, that gives us a 320 dimensional space. This is implemented in the file utils/encoder_decoder.py. The module that I use for this is biovec, implemented in this GitHub Repo. Please make sure to pay attention to this issue. If you're using the esm model, there should be no issues related to installation since esm is implemented in transformers. This package is required only if you're plannning to train the Helix-only model.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Helix_in_protein		Helix_in_protein
Helix_only		Helix_only
csv_files		csv_files
images_documentation		images_documentation
trained_models		trained_models
.gitignore		.gitignore
1_training_and_validation_only_helix.ipynb		1_training_and_validation_only_helix.ipynb
2_training_and_validation_with_protein.ipynb		2_training_and_validation_with_protein.ipynb
3_inference_of_Helix-in-protein_trained_model.ipynb		3_inference_of_Helix-in-protein_trained_model.ipynb
README.md		README.md
swissprot-reviewed-protvec.model		swissprot-reviewed-protvec.model

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AlphaMut

README

Running the Inference Model

Training the Model

Packages Required

References

About

Releases

Packages

Languages

prathithbhargav/AlphaMut

Folders and files

Latest commit

History

Repository files navigation

AlphaMut

README

Running the Inference Model

Training the Model

Packages Required

References

Footnotes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages