4.6 Article

PVP-SVM: Sequence-Based Prediction of Phage Virion Proteins Using a Support Vector Machine

Journal

FRONTIERS IN MICROBIOLOGY
Volume 9, Issue -, Pages -

Publisher

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2018.00476

Keywords

bacteriophage virion proteins; feature selection; hybrid features; machine learning; support vector machine

Categories

Funding

  1. Basic Science Research Program through the National Research Foundation (NRF) of Korea - Ministry of Education, Science, and Technology [2015R1D1A1A09060192, 2009-0093826]
  2. Brain Research Program through the National Research Foundation of Korea (NRF) - Ministry of Science, ICT & Future Planning [2016M3C7A1904392]
  3. National Research Foundation of Korea [2015R1D1A1A09060192, 2009-0093826] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

Ask authors/readers for more resources

Accurately identifying bacteriophage virion proteins from uncharacterized sequences is important to understand interactions between the phage and its host bacteria in order to develop new antibacterial drugs. However, identification of such proteins using experimental techniques is expensive and often time consuming: hence, development of an efficient computational algorithm for the prediction of phage virion proteins (PVPs) prior to in vitro experimentation is needed. Here, we describe a support vector machine (SVM)-based PVP predictor, called PVP-SVM, which was trained with 136 optimal features. A feature selection protocol was employed to identify the optimal features from a large set that included amino acid composition, dipeptide composition, atomic composition, physicochemical properties, and chain-transition-distribution. PVP-SVM achieved an accuracy of 0.870 during leave-one-out cross-validation, which was 6% higher than control SVM predictors trained with all features, indicating the efficiency of the feature selection method. Furthermore, PVP-SVM displayed superior performance compared to the currently available method, PVPred, and two other machine-learning methods developed in this study when objectively evaluated with an independent dataset. For the convenience of the scientific community, a user-friendly and publicly accessible web server has been established at www.thegleelab.org/PVP-SVM/PVP-SVM.html.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available