The secondary structure of ribonucleic acid (RNA) is more stable and accessible in the cell than its tertiary structure, making it essential for functional prediction. Although deep learning has shown promising results in this field, current methods suffer from poor generalization and high complexity. In this work, we present RFold, a simple yet effective RNA secondary structure prediction in an end-to-end manner. RFold introduces a decoupled optimization process that decomposes the vanilla constraint satisfaction problem into row-wise and column-wise optimization, simplifying the solving process while guaranteeing the validity of the output. Moreover, RFold adopts attention maps as informative representations instead of designing hand-crafted features. Extensive experiments demonstrate that RFold achieves competitive performance and about eight times faster inference efficiency than the state-of-the-art method.
We show the overall RFold framework.
We comprehensively evaluate different results on the RNAStralign, ArchiveII datasets.
We provide a Colab demo for reproducing the results and testing RNA sequences by yourself:
If you are interested in our repository and our paper, please cite the following paper:
@inproceedings{tandeciphering,
title={Deciphering RNA Secondary Structure Prediction: A Probabilistic K-Rook Matching Perspective},
author={Tan, Cheng and Gao, Zhangyang and Hanqun, CAO and Chen, Xingran and Wang, Ge and Wu, Lirong and Xia, Jun and Zheng, Jiangbin and Li, Stan Z},
booktitle={Forty-first International Conference on Machine Learning}
}
If you have any issues about this work, please feel free to contact me by email:
- Cheng Tan: tancheng@westlake.edu.cn