4.7 Article

Protein-Protein Interaction Sites Prediction Using Batch Normalization Based CNNs and Oversampling Method Borderline-SMOTE

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2023.3238001

关键词

Proteins; Protein engineering; Feature extraction; Training; Random forests; Biological system modeling; Predictive models; Batch normalization; borderline-SMOTE; convolutional neural networks (CNNs); protein-protein interaction sites; sample imbalance

向作者/读者索取更多资源

In this study, a novel model combining convolutional neural networks (CNNs) with Batch Normalization is designed to predict protein-protein interaction sites (PPIs), using an oversampling technique Borderline-SMOTE to address the problem of sample imbalance. The effectiveness of the method is validated by comparing it with existing state-of-the-art schemes, and achieved improved accuracies on three public datasets.
The recognition of protein-protein interaction sites (PPIs) is beneficial for the interpretation of protein functions and the development of new drugs. Traditional biological experiments to identify PPI sites are expensive and inefficient, leading to the generation of various computational methods to predict PPIs. However, the accurate prediction of PPI sites remains a big challenge due to the existence of the sample imbalance issue. In this work, we design a novel model that combines convolutional neural networks (CNNs) with Batch Normalization to predict PPI sites, and employ an oversampling technique Borderline-SMOTE to address the sample imbalance issue. In particular, to better characterize the amino acid residues on the protein chains, we employ a sliding window approach for feature extraction of target residues and their contextual residues. We verify the effectiveness of our method by comparing our method with the existing state-of-the-art schemes. The performance validations of our method on three public datasets achieve accuracies of 88.6%, 89.9%, and 86.7%, respectively, all showing improved accuracies compared with the existing schemes. Moreover, the ablation experiment results suggest that Batch Normalization can greatly improve the generalization and the prediction stability of our model.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据