Antimicrobial resistance is a growing health concern. Antimicrobial peptides (AMPs) disrupt harmful microorganisms by non-specific mechanisms, making it difficult for microbes to develop resistance. Accordingly, they are promising alternatives to traditional antimicrobial drugs. In this study, we developed an improved AMP classification model, called AMP-BERT. We propose a deep learning model with a fine-tuned BERT architecture designed to extract structural/functional information from input peptides and identify each input as AMP or non-AMP. We compared the performance of our proposed model and other machine learning-based and deep learning-based methods. Our model, AMP-BERT, yielded the best prediction results among all models evaluated with our curated external dataset. In addition, we utilized the attention mechanism in BERT to implement an interpretable feature analysis and determine the specific residues in known AMPs that contribute to peptide structure and antimicrobial function. The results show that AMP-BERT can capture the structural properties of peptides for model learning, enabling the prediction of AMPs or non-AMPs from input sequences. AMP-BERT is expected to contribute to the identification of candidate AMPs for functional validation and drug development.
The fine-tuned AMP-BERT model can be reproduced and implemented for the AMP prediction downstream task in this Colab notebook.
The pre-trained model is based on ProtBERT-BFD from Elnaggar et al., 2021.
AMP-BERT follows GPL 3.0v license. Therefore, AMP-BERT is open source and free to use for everyone.
Hansol Lee (hansol.lee@gist.ac.kr)
Hojung Nam* (hjnam@gist.ac.kr)
*Corresponding Author