4.7 Article

Empirical comparison and analysis of web-based cell-penetrating peptide prediction tools

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 21, Issue 2, Pages 408-420

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bby124

Keywords

cell-penetrating peptides; machine learning algorithm; feature representation; web servers

Funding

  1. National Natural Science Foundation of China [61701340, 61702361, 61771331]
  2. Natural Science Foundation of Tianjin City [18JCQNJC00500, 18JCQNJC00800]
  3. National Key R&D Program of China [SQ2018YFC090002]
  4. Basic Science Research Program through the National Research Foundation of Korea - Ministry of Education, Science, and Technology [2018R1D1A1B07049572]

Ask authors/readers for more resources

Cell-penetrating peptides (CPPs) facilitate the delivery of therapeutically relevant molecules, including DNA, proteins and oligonucleotides, into cells both in vitro and in vivo. This unique ability explores the possibility of CPPs as therapeutic delivery and its potential applications in clinical therapy. Over the last few decades, a number of machine learning (ML)-based prediction tools have been developed, and some of them are freely available as web portals. However, the predictions produced by various tools are difficult to quantify and compare. In particular, there is no systematic comparison of the web-based prediction tools in performance, especially in practical applications. In this work, we provide a comprehensive review on the biological importance of CPPs, CPP database and existing ML-based methods for CPP prediction. To evaluate current prediction tools, we conducted a comparative study and analyzed a total of 12 models from 6 publicly available CPP prediction tools on 2 benchmark validation sets of CPPs and non-CPPs. Our benchmarking results demonstrated that a model from the KELM-CPPpred, namely KELM-hybrid-AAC, showed a significant improvement in overall performance, when compared to the other 11 prediction models. Moreover, through a length-dependency analysis, we find that existing prediction tools tend to more accurately predict CPPs and non-CPPs with the length of 20-25 residues long than peptides in other length ranges.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available