4.7 Article

BERMP: a cross-species classifier for predicting m(6)A sites by integrating a deep learning algorithm and a random forest approach

期刊

INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES
卷 14, 期 12, 页码 1669-1677

出版社

IVYSPRING INT PUBL
DOI: 10.7150/ijbs.27819

关键词

Deep learning; Recurrent neural network; bidirectional Gated Recurrent Unit; N-6-methyladenosine; Random forest

资金

  1. Young Scientists Fund of the National Natural Science Foundation of China [31701142, 81602621]
  2. Qingdao Postdoctoral Science Foundation [2016061]
  3. Shandong Provincial Natural Science Foundation [ZR2016CM14]
  4. National Natural Science Foundation of China [31770821]

向作者/读者索取更多资源

N-6-methyladenosine (m(6)A) is a prevalent RNA methylation modification involved in several biological processes. Hundreds or thousands of m(6)A sites identified from different species using high-throughput experiments provides a rich resource to construct in-silico approaches for identifying m(6)A sites. The existing m(6)A predictors are developed using conventional machine-learning (ML) algorithms and most are species-centric. In this paper, we develop a novel cross-species deep-learning classifier based on bidirectional Gated Recurrent Unit (BGRU) for the prediction of m(6)A sites. In comparison with conventional ML approaches, BGRU achieves outstanding performance for the Mammalia dataset that contains over fifty thousand m(6)A sites but inferior for the Saccharomyces cerevisiae dataset that covers around a thousand positives. The accuracy of BGRU is sensitive to the data size and the sensitivity is compensated by the integration of a random forest classifier with a novel encoding of enhanced nucleic acid content. The integrated approach dubbed as BGRU-based Ensemble RNA Methylation site Predictor (BERMP) has competitive performance in both cross-validation test and independent test. BERMP also outperforms existing m(6)A predictors for different species. Therefore, BERMP is a novel multi-species tool for identifying m(6)A sites with high confidence.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据