4.6 Article

A Framework for Identifying Essential Proteins with Hybridizing Deep Neural Network and Ordinary Least Squares

期刊

APPLIED SCIENCES-BASEL
卷 13, 期 15, 页码 -

出版社

MDPI
DOI: 10.3390/app13158613

关键词

essential proteins; deep neural network; ordinary least squares; protein-protein interaction network

向作者/读者索取更多资源

Identifying essential proteins is crucial for understanding cellular requirements, discovering pathogenic genes, and diagnosing diseases. The integration of protein-protein interaction networks and biological sequence features enhances the accuracy of essential protein identification. A deep neural network method named IYEPDNN was used in this study, achieving a high accuracy of 84% and outperforming other state-of-the-art methods.
Essential proteins are vital for maintaining life activities and play a crucial role in biological processes. Identifying essential proteins is of utmost importance as it helps in understanding the minimal requirements for cell life, discovering pathogenic genes and drug targets, diagnosing diseases, and comprehending the mechanism of biological evolution. The latest research suggests that integrating protein-protein interaction (PPI) networks and relevant biological sequence features can enhance the accuracy and robustness of essential protein identification. In this paper, a deep neural network (DNN) method was used to identify a yeast essential protein, which was named IYEPDNN. The method combines gene expression profiles, PPI networks, and orthology as input features to improve the accuracy of DNN while reducing computational complexity. To enhance the robustness of the yeast dataset, the common least squares method is used to supplement absenting data. The correctness and effectiveness of the IYEPDNN method are verified using the DIP and GAVIN databases. Our experimental results demonstrate that IYEPDNN achieves an accuracy of 84%, and it outperforms state-of-the-art methods (WDC, PeC, OGN, ETBUPPI, RWAMVL, etc.) in terms of the number of essential proteins identified. The findings of this study demonstrate that the correlation between features plays a crucial role in enhancing the accuracy of essential protein prediction. Additionally, selecting the appropriate training data can effectively address the issue of imbalanced training data in essential protein identification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据