4.5 Article

Identification of hormone-binding proteins using a novel ensemble classifier

Journal

COMPUTING
Volume 101, Issue 6, Pages 693-703

Publisher

SPRINGER WIEN
DOI: 10.1007/s00607-018-0682-x

Keywords

Protein classification; Feature extraction; ANOVA; Ensemble classifier

Funding

  1. National Key R&D Program of China [SQ2018YFC090002]
  2. Natural Science Foundation of China [61771331, 61871282]

Ask authors/readers for more resources

Hormone-binding proteins (HBPs) are important soluble carriers for growth hormones, and correct recognition of HBPs is crucial to understanding their functions. Therefore, we aimed to construct an efficient and reliable classifier to identify HBPs accurately. At first, 246 proteins were collected from UniProt database and considered as the objective benchmark dataset. We employed the 8000-dimensional feature extraction method based on tripeptide compositions to formulate protein samples. Subsequently, we alleviated the intricate feature set by utilizing ANOVA, a feature ranking technique, and acquired the optimal feature subset devoid of redundant information. Furthermore, we utilized three classification methods to process the selected tripeptide features, which generated three probability sequences. Finally, the three probability sequences were considered as new features, and addressed by the support vector machine to construct a prediction model. Results indicated that 90.6% of accuracy was achieved in five-fold cross validation, which was superior to that of other published methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available