4.7 Article

MOWGLI: prediction of protein-MannOse interacting residues With ensemble classifiers usinG evoLutionary Information

Journal

JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS
Volume 34, Issue 10, Pages 2069-2083

Publisher

TAYLOR & FRANCIS INC
DOI: 10.1080/07391102.2015.1106978

Keywords

position-specific scoring matrix; consensus approach; 10-fold cross-validation; first-line host defense; imbalanced benchmark data set; random forest; support vector machines

Ask authors/readers for more resources

Proteins interact with carbohydrates to perform various cellular interactions. Of the many carbohydrate ligands that proteins bind with, mannose constitute an important class, playing important roles in host defense mechanisms. Accurate identification of mannose-interacting residues (MIR) may provide important clues to decipher the underlying mechanisms of protein-mannose interactions during infections. This study proposes an approach using an ensemble of base classifiers for prediction of MIR using their evolutionary information in the form of position-specific scoring matrix. The base classifiers are random forests trained by different subsets of training data set Dset128 using 10-fold cross-validation. The optimized ensemble of base classifiers, MOWGLI, is then used to predict MIR on protein chains of the test data set Dtestset29 which showed a promising performance with 92.0% accurate prediction. An overall improvement of 26.6% in precision was observed upon comparison with the state-of-art. It is hoped that this approach, yielding enhanced predictions, could be eventually used for applications in drug design and vaccine development.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available