4.7 Article

LightGBM-PPI: Predicting protein-protein interactions through LightGBM with multi-information fusion

期刊

出版社

ELSEVIER
DOI: 10.1016/j.chemolab.2019.06.003

关键词

Protein-protein interactions; Multi-information fusion; Elastic net; LightGBM

资金

  1. National Natural Science Foundation of China [61863010]
  2. Natural Science Foundation of Shandong Province of China [ZR2017MA014, ZR2018MC007]
  3. Project of Shandong Province Higher Educational Science and Technology Program [J17KA159]
  4. Scientific Research Fund of Hunan Provincial Key Laboratory of Mathematical Modeling and Analysis in Engineering [2018MMAEZD10]
  5. National Science Foundation [ACI-1548562]

向作者/读者索取更多资源

Protein-protein interactions (PPIs) play an important role in cell life activities such as transcriptional regulation, signal transduction and drug signal transduction. The study of PPIs has become a research hotspot in bioinformatics. However, the identification of PPIs using experimental methods is time-consuming and costly. PPIs prediction based on machine learning is very important. This paper proposes a new protein-protein interactions prediction method called LightGBM-PPI. First, pseudo amino acid composition, autocorrelation descriptor, local descriptor, conjoint triad are employed to extract feature information. Secondly, we use the elastic net to select the optimal feature subset and eliminate redundant features. Finally, the LightGBM is employed as the classifier to predict PPIs and the LightGBM-PPI model is built up. Five-fold cross-validation shows that the prediction accuracy of the Helicobacter pylori and Saccharomyces cerevisiae datasets are 89.03% and 95.07%, respectively. The prediction accuracy of Caenorhabditis elegans, Escherichia coli, Homo sapiens and Mus musculus are 90.16%, 92.16%, 94.83% and 94.57%, respectively, which are superior to the state-of-the-art prediction methods. To further evaluate the advantages and disadvantages of the model, we use one-core network and the crossover network for the Wnt-related pathway to predict PPIs, which can provide new ideas for drug design and disease prevention. The source code and all datasets are available at https://github.com/QUST-AIBBDRC/LightGBM-PPI/.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据