4.7 Article

RFhy-m2G: Identification of RNA N2-methylguanosine modification sites based on random forest and hybrid features

Journal

METHODS
Volume 203, Issue -, Pages 32-39

Publisher

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ymeth.2021.05.016

Keywords

N2-methylguanosine; Random forest; MRMD2.0; Hybrid feature; Feature analysis

Funding

  1. Natural Science Foundation of China [62072353, 61922020, 61672406]
  2. Fundamental Research Funds for the Central Universities [JB180307]

Ask authors/readers for more resources

A novel predictor, RFhy-m2G, was developed in this study to identify m2G modification sites using hybrid features and random forest. The predictor achieved high accuracies through feature fusion and optimal feature selection.
N2-methylguanosine is a post-transcriptional modification of RNA that is found in eukaryotes and archaea. The biological function of m2G modification discovered so far is to control and stabilize the three-dimensional structure of tRNA and the dynamic barrier of reverse transcription. To discover additional biological functions of m2G, it is necessary to develop time-saving and labor-saving calculation tools to identify m2G. In this paper, based on hybrid features and a random forest, a novel predictor, RFhy-m2G, was developed to identify the m2G modification sites for three species. The hybrid feature used by the predictor is used to fuse the three features of ENAC, PseDNC, and NPPS. These three features include primary sequence derivation properties, physicochemical properties, and position-specific properties. Since there are redundant features in hybrid features, MRMD2.0 is used for optimal feature selection. Through feature analysis, it is found that the optimal hybrid features obtained still contain three kinds of properties, and the hybrid features can more accurately identify m2G modification sites and improve prediction performance. Based on five-fold cross-validation and independent testing to evaluate the prediction model, the accuracies obtained were 0.9982 and 0.9417, respectively. The robustness of the predictor is demonstrated by comparisons with other predictors.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available