Skip to content

Latest commit

 

History

History
67 lines (39 loc) · 3.3 KB

README.md

File metadata and controls

67 lines (39 loc) · 3.3 KB

CycPeptMP

License: MIT

About

  • Python implementation of CycPeptMP.
  • CycPeptMP is an accurate and efficient model for predicting the membrane permeability of cyclic peptides.
  • We designed features for cyclic peptides at the atom, monomer, and peptide levels to concurrently capture both the local sequence variations and global conformational changes in cyclic peptides. We also applied data augmentation techniques at three scales to enhance model training efficiency.

framework

Requirements

  • Python: 3.9.6
  • Numpy: 1.25.0
  • Pandas: 1.4.4
  • Pytorch: 2.0.0 (CUDA: 11.7)
  • RDKit: 2022.09.5
  • Mordred: 1.2.0
  • MOE: 2019.01 (commercial software)

Dataset

  • Original cyclic peptide structure (SMILES) and experimentally determined membrane permeability (LogPexp) used in this study (data/CycPeptMPDB_Peptide_All.csv) were all sourced from CycPeptMPDB.
  • Correspondence table of peptides and their constituent monomers is summarized in data/monomer_table.csv.
  • Data used in this experiment, with duplicates removed (all: 7,451->7,337, PAMPA: 6,941->6,889), is summarized in desc/peptide_used.csv.
  • Dataset split index is stored in data/eval_index/.

    *_ID.npy shows the CycPeptMPDB peptide ID, and *_index.npy shows the index in sorted desc/peptide_used.csv.

Code

  • Testset.ipynb

    Prediction for the test set (and other assay data) shown in the paper by CycPeptMP and other baselines. Please download complete input files model/input/Trans/60/ from Google Drive.

  • Newdata.ipynb

    Prediction for new data.

Pretrained weights

  • Weights of CycPeptMP (60 times augmentation) for three validation runs (Fusion-60_cv*.cpt).
  • Weights of fusion model with no augmentation (Fusion-1_cv*.cpt) and 20 times augmentation (Fusion-20_cv*.cpt) for three validation runs in ablation studies.

Reference

Contact