4.7 Article

iRNA-ac4C: A novel computational method for effectively detecting N4-acetylcytidine sites in human mRNA

Journal

INTERNATIONAL JOURNAL OF BIOLOGICAL MACROMOLECULES
Volume 227, Issue -, Pages 1174-1181

Publisher

ELSEVIER
DOI: 10.1016/j.ijbiomac.2022.11.299

Keywords

N4-acetylcytidine; Feature selection; Gradient boosting decision tree; Machine learning

Ask authors/readers for more resources

A novel predictor, iRNA-ac4C, was developed to identify ac4C sites in human mRNA using three feature extraction methods. The results showed promising generalization capabilities.
RNA N4-acetylcytidine (ac4C) is the acetylation of cytidine at the nitrogen-4 position, which is a highly conserved RNA modification and involves a variety of biological processes. Hence, accurate identification of genome-wide ac4C sites is vital for understanding regulation mechanism of gene expression. In this work, a novel predictor, named iRNA-ac4C, was established to identify ac4C sites in human mRNA based on three feature extraction methods, including nucleotide composition, nucleotide chemical property, and accumulated nucleo-tide frequency. Subsequently, minimum-Redundancy-Maximum-Relevance combined with incremental feature selection strategies was utilized to select the optimal feature subset. According to the optimal feature subset, the best ac4C classification model was trained by gradient boosting decision tree with 10-fold cross-validation. The results of independent testing set indicated that our proposed method could produce encouraging generalization capabilities. For the convenience of other researchers, we established a user-friendly web server which is freely available at http://lin-group.cn/server/iRNA-ac4C/. We hope that the tool could provide guide for wet-experimental scholars.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available