☆ 4.6 Article

Effect of Various Sequence Descriptors in Predicting Human Protein-protein Interactions Using ANN-based Prediction Models

CURRENT BIOINFORMATICS (2021)

期刊

CURRENT BIOINFORMATICS

卷 16, 期 8, 页码 1024-1033

出版社

BENTHAM SCIENCE PUBL LTD

DOI: 10.2174/1574893616666210402114623

关键词

Protein-protein interactions; machine learning; protein descriptors; artificial neural network; PPI-predictions; hu-man PPI

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

UGC-SAP of the Department of Biotechnology & Bioinformatics, & Bioinformatics Infrastruc-ture Facility (BIF) at the School of Life Sciences, the University of Hyderabad

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Understanding protein-protein interactions is crucial for investigating the biological role of proteins. This study evaluates the efficacy of various sequence-based descriptors in predicting PPIs, with the results showing that the conjoint-triad descriptors performed the best.

Aims: A number of sequence-based descriptors for proteins have been proposed by many researchers. This study aims to evaluate the performance of these descriptors in predicting protein-protein interactions on the benchmark dataset. Background: The behavior of a protein inside or outside the cell is defined by its interaction with the elements present in the surrounding environment, which include small metabolites to the macromolecules such as RNA, DNA, or proteins. Of these, understanding protein-protein interactions (PPIs) is one of the important aspects to investigate the biological role of a protein. The interactions of a protein are determined by how it folds in 3-dimensional space, and this threedimensional folding of a protein largely depends on the linear sequence of amino acids. This information makes it possible to exploit the sequences for proteins to computationally determine the possible interactions among them. Objective: This study aims at studying the efficacy of various sequence-based descriptors in predicting protein-protein interactions. Methods: In this study, we have used the benchmark dataset of interacting and non-interacting protein pairs provided by Pan et al. to build the PPI prediction models using artificial neural networks. We have compared the efficacy of different descriptors on two types of datasets, one with all the protein pairs and the second with proteins having less than 25% identity. Result: The results show that conjoint-triad descriptors performed better than other descriptors in predicting PPIs. The feature selection on the conjoint triad was performed and the effect on the prediction model with reduced features versus all feature sets was studied. Conclusion: The classification model with conjoint-triad descriptors obtained the highest accuracy. The feature ranking for the conjoint triad descriptor was utilized and the model performance was compared with all and selected features. The model with reduced features shows less overfitting.

Effect of Various Sequence Descriptors in Predicting Human Protein-protein Interactions Using ANN-based Prediction Models

期刊

CURRENT BIOINFORMATICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Effect of Various Sequence Descriptors in Predicting Human Protein-protein Interactions Using ANN-based Prediction Models

期刊

CURRENT BIOINFORMATICS

出版社

BENTHAM SCIENCE PUBL LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文