4.8 Article

m5U-SVM: identification of RNA 5-methyluridine modification sites based on multi-view features of physicochemical features and distributed representation

Journal

BMC BIOLOGY
Volume 21, Issue 1, Pages -

Publisher

BMC
DOI: 10.1186/s12915-023-01596-0

Keywords

5-Methyluridine; Support vector machines; Multi-view feature; Word2Vec

Categories

Ask authors/readers for more resources

In this study, a novel predictor called m5U-SVM was developed to identify m5U modification sites from RNA sequences using multi-view features and machine learning algorithms. The optimized multi-view features were obtained from traditional physicochemical features and distributed representation features. The proposed model outperformed the existing state-of-the-art tool in terms of performance.
BackgroundRNA 5-methyluridine (m5U) modifications are obtained by methylation at the C-5 position of uridine catalyzed by pyrimidine methylation transferase, which is related to the development of human diseases. Accurate identification of m5U modification sites from RNA sequences can contribute to the understanding of their biological functions and the pathogenesis of related diseases. Compared to traditional experimental methods, computational methods developed based on machine learning with ease of use can identify modification sites from RNA sequences in an efficient and time-saving manner. Despite the good performance of these computational methods, there are some drawbacks and limitations.ResultsIn this study, we have developed a novel predictor, m5U-SVM, based on multi-view features and machine learning algorithms to construct predictive models for identifying m5U modification sites from RNA sequences. In this method, we used four traditional physicochemical features and distributed representation features. The optimized multi-view features were obtained from the four fused traditional physicochemical features by using the two-step LightGBM and IFS methods, and then the distributed representation features were fused with the optimized physicochemical features to obtain the new multi-view features. The best performing classifier, support vector machine, was identified by screening different machine learning algorithms. Compared with the results, the performance of the proposed model is better than that of the existing state-of-the-art tool.Conclusionsm5U-SVM provides an effective tool that successfully captures sequence-related attributes of modifications and can accurately predict m5U modification sites from RNA sequences. The identification of m5U modification sites helps to understand and delve into the related biological processes and functions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available