4.7 Article

Protein-Protein Interaction Sites Prediction Using Batch Normalization Based CNNs and Oversampling Method Borderline-SMOTE

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TCBB.2023.3238001

Keywords

Proteins; Protein engineering; Feature extraction; Training; Random forests; Biological system modeling; Predictive models; Batch normalization; borderline-SMOTE; convolutional neural networks (CNNs); protein-protein interaction sites; sample imbalance

Ask authors/readers for more resources

In this study, a novel model combining convolutional neural networks (CNNs) with Batch Normalization is designed to predict protein-protein interaction sites (PPIs), using an oversampling technique Borderline-SMOTE to address the problem of sample imbalance. The effectiveness of the method is validated by comparing it with existing state-of-the-art schemes, and achieved improved accuracies on three public datasets.
The recognition of protein-protein interaction sites (PPIs) is beneficial for the interpretation of protein functions and the development of new drugs. Traditional biological experiments to identify PPI sites are expensive and inefficient, leading to the generation of various computational methods to predict PPIs. However, the accurate prediction of PPI sites remains a big challenge due to the existence of the sample imbalance issue. In this work, we design a novel model that combines convolutional neural networks (CNNs) with Batch Normalization to predict PPI sites, and employ an oversampling technique Borderline-SMOTE to address the sample imbalance issue. In particular, to better characterize the amino acid residues on the protein chains, we employ a sliding window approach for feature extraction of target residues and their contextual residues. We verify the effectiveness of our method by comparing our method with the existing state-of-the-art schemes. The performance validations of our method on three public datasets achieve accuracies of 88.6%, 89.9%, and 86.7%, respectively, all showing improved accuracies compared with the existing schemes. Moreover, the ablation experiment results suggest that Batch Normalization can greatly improve the generalization and the prediction stability of our model.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available