4.8 Article

UFold: fast and accurate RNA secondary structure prediction with deep learning

期刊

NUCLEIC ACIDS RESEARCH
卷 50, 期 3, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/nar/gkab1074

关键词

-

资金

  1. NSF [DMS1763272, IIS-1715017]
  2. NIH [U54-CA217378]
  3. Simons Foundation [594598]

向作者/读者索取更多资源

UFold is a deep learning-based method for RNA secondary structure prediction, which uses a novel image-like representation of RNA sequences to achieve accurate predictions in a short time, outperforming previous methods on within-family datasets and showing similar performance on distinct RNA families.
For many RNA molecules, the secondary structure is essential for the correct function of the RNA. Predicting RNA secondary structure from nucleotide sequences is a long-standing problem in genomics, but the prediction performance has reached a plateau over time. Traditional RNA secondary structure prediction algorithms are primarily based on thermodynamic models through free energy minimization, which imposes strong prior assumptions and is slow to run. Here, we propose a deep learning-based method, called UFold, for RNA secondary structure prediction, trained directly on annotated data and base-pairing rules. UFold proposes a novel image-like representation of RNA sequences, which can be efficiently processed by Fully Convolutional Networks (FCNs). We benchmark the performance of UFold on both within- and cross-family RNA datasets. It significantly outperforms previous methods on within-family datasets, while achieving a similar performance as the traditional methods when trained and tested on distinct RNA families. UFold is also able to predict pseudoknots accurately. Its prediction is fast with an inference time of about 160 ms per sequence up to 1500 bp in length. An online web server running UFold is available at . Code is available at .

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据