☆ 4.7 Review

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

BRIEFINGS IN BIOINFORMATICS (2021)

期刊

BRIEFINGS IN BIOINFORMATICS

卷 22, 期 5, 页码 -

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbab083

关键词

antimicrobial peptides; bioinformatics; machine learning; deep learning; feature engineering; predictors

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

National Health and Medical Research Council of Australia (NHMRC) [1144652, 1127948]
Young Scientists Fund of the National Natural Science Foundation of China [31701142]
Australian Research Council (ARC) [LP110200333, DP120104460]
Major Inter-Disciplinary Research (IDR) project - Monash University
Collaborative Research Programof Institute for Chemical Research, Kyoto University [2019-32, 2018-28]
National Natural Science Foundation of China [62072243, 61772273]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Antimicrobial peptides (AMPs) are a diverse group of molecules playing crucial roles in biological processes and cellular functions. With the emerging global concern of antimicrobial resistance, research on AMPs has gained popularity. Various computational methods have been developed for accurate prediction of AMPs, with differences in data sets and algorithms affecting predictive performance.

Antimicrobial peptides (AMPs) are a unique and diverse group of molecules that play a crucial role in a myriad of biological processes and cellular functions. AMP-related studies have become increasingly popular in recent years due to antimicrobial resistance, which is becoming an emerging global concern. Systematic experimental identification of AMPs faces many difficulties due to the limitations of current methods. Given its significance, more than 30 computational methods have been developed for accurate prediction of AMPs. These approaches show high diversity in their data set size, data quality, core algorithms, feature extraction, feature selection techniques and evaluation strategies. Here, we provide a comprehensive survey on a variety of current approaches for AMP identification and point at the differences between these methods. In addition, we evaluate the predictive performance of the surveyed tools based on an independent test data set containing 1536 AMPs and 1536 non-AMPs. Furthermore, we construct six validation data sets based on six different common AMP databases and compare different computational methods based on these data sets. The results indicate that amPEPpy achieves the best predictive performance and outperforms the other compared methods. As the predictive performances are affected by the different data sets used by different methods, we additionally perform the 5-fold cross-validation test to benchmark different traditional machine learning methods on the same data set. These cross-validation results indicate that random forest, support vector machine and eXtreme Gradient Boosting achieve comparatively better performances than other machine learning methods and are often the algorithms of choice of multiple AMP prediction tools.

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Comprehensive assessment of machine learning-based methods for predicting antimicrobial peptides

期刊

BRIEFINGS IN BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文