4.7 Article

THRONE: A New Approach for Accurate Prediction of Human RNA N7-Methyl-guanosine Sites

Journal

JOURNAL OF MOLECULAR BIOLOGY
Volume 434, Issue 11, Pages -

Publisher

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.jmb.2022.167549

Keywords

-

Funding

  1. National Research Foundation of Korea (NRF) - Korean government (MSIT) [2021R1A2C1014338, 2020R1A4A4079722]
  2. National Research Foundation of Korea [2020R1A4A4079722] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

In this study, a novel predictor called THRONE was developed to accurately identify m7G sites in the human genome. THRONE utilizes multiple sequence-based features and machine learning classifiers, and combines multiple models through ensemble learning. The proposed method outperformed existing methods in predicting m7G sites.
N-7-methylguanosine (m7G) is an essential, ubiquitous, and positively charged modification at the 50 cap of eukaryotic mRNA, modulating its export, translation, and splicing processes. Although several machine learning (ML)-based computational predictors for m7G have been developed, all utilized specific computational framework. This study is the first instance we explored four different computational frameworks and identified the best approach. Based on that we developed a novel predictor, THRONE (A three layer ensemble predictor for identifying human RNA N-7-methylguanosine sites) to accurately identify m7G sites from the human genome. THRONE employs a wide range of sequence-based features inputted to several ML classifiers and combines these models through ensemble learning. The three step ensemble learning is as follows: 54 baseline models were constructed in the first layer and the predicted probability of m7G was considered as a new feature vector for the sequential step. Subsequently, six meta-models were created using the new feature vector and their predicted probability was yet again considered as novel features. Finally, random forest was deemed as the best super classifier learner for the final prediction using a systematic approach incorporated with novel features. Interestingly, THRONE outperformed other existing methods in the prediction of m7G sites on both cross-validation analysis and independent evaluation. The proposed method is publicly accessible at: http://thegleelab.org/THRONE/ and expects to help the scientific community identify the putative m7G sites and formulate a novel testable biological hypothesis. (C) 2022 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available