4.5 Article

Accurate prediction of species-specific 2-hydroxyisobutyrylation sites based on machine learning frameworks

期刊

ANALYTICAL BIOCHEMISTRY
卷 602, 期 -, 页码 -

出版社

ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.ab.2020.113793

关键词

Lysine 2-hydroxyisobutyrylation; Machine learning; Support vector machine; Random forest; Post-translational modification

资金

  1. National Natural Science Foundation of China [21665016, 21175064]

向作者/读者索取更多资源

Lysine 2-hydroxyisobutyrylation (K-hib) is a newly discovered post-translational modification (PTM) across eukaryotes and prokaryotes in recent years, which plays a significant role in diverse cellular functions. Accurate prediction of K-hib sites is a first-crucial step to decipher its molecular mechanism and urgently needed. In this work, based on a large benchmark datasets in multi-species, a novel online species-specific prediction tool, namely KhibPred, was developed to identify K-hib sites. Four types of feature strategies, including sequence-based information, physicochemical properties and evolutionary-derived information, were applied to represent a wide range of protein sequences, and the random forest was used to build the optimal feature datasets. Moreover, six representative machine learning (ML) methods were trained and comprehensively discussed and compared for each organism. Data analyses suggested that the unique protein sequence preferences were discovered for each species. When evaluated on independent test datasets, the area under the receiver operating characteristic curves (AUCs) achieved 0.807, 0.781, 0.825 and 0.831 for Saccharomyces cerevisiaes, Physcomitrella patens, Rice Seeds and HeLa cells, respectively. The satisfactory results imply that KhibPred is a promising computational tool. The online predictor can be freely available at http://bioinfo.ncu.edu.cn/KhibPred.aspx.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据