☆ 4.5 Article

Improving reverse vaccinology with a machine learning approach

VACCINE (2011)

Journal

VACCINE

Volume 29, Issue 45, Pages 8156-8164

Publisher

ELSEVIER SCI LTD

DOI: 10.1016/j.vaccine.2011.07.142

Keywords

Reverse vaccinology; Bacterial pathogens; Protective antigen; Support vector machines

Funding

Genomics Core at the UCSD Center for AIDS Research [AI36214]
Biomedical Informatics Research Center at San Diego State University
San Diego Veterans Medical Research Foundation

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Reverse vaccinology aims to accelerate subunit vaccine design by rapidly predicting which proteins in a pathogenic bacterial proteome are putative protective antigens. Support vector machine classification is a machine learning approach that has been applied to solve numerous classification problems in biological sciences but has not previously been incorporated into a reverse vaccinology approach. A training data set of 136 bacterial protective antigens paired with 136 non-antigens was constructed and bioinformatic tools were used to annotate this data for predicted protein features, many of which are associated with antigenicity (i.e. extracellular localization, signal peptides and B-cell epitopes). Annotation was used to train support vector machine classifiers that exhibited a maximum accuracy of 92% for discriminating protective antigens from non-antigens as assessed by a leave-tenth-out cross-validation approach. These accuracies were superior to those achieved when annotating training data with auto and cross covariance transformations of z-descriptors for hydrophobicity, molecular size and polarity, or when classification was performed using regression methods. To further validate support vector machine classifiers, they were used to rank all the proteins in six bacterial proteomes for their antigenicity. Protective antigens from the training data were significantly recalled (enriched) in the top 75 ranked proteins for all six proteomes as assessed by a Fisher's exact test (p < 0.05). This paper describes a superior workflow for performing reverse vaccinology studies and provides a benchmark training data set that can be used to evaluate future methodological improvements. Published by Elsevier Ltd.

Improving reverse vaccinology with a machine learning approach

Journal

VACCINE

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improving reverse vaccinology with a machine learning approach

Journal

VACCINE

Publisher

ELSEVIER SCI LTD

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper