4.3 Article

hCKSAAP_UbSite: Improved prediction of human ubiquitination sites by exploiting amino acid pattern and properties

Journal

BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS
Volume 1834, Issue 8, Pages 1461-1467

Publisher

ELSEVIER
DOI: 10.1016/j.bbapap.2013.04.006

Keywords

Ubiquitination site; Post-translational modification; Composition of k-spaced amino acid pairs; Amino acid physicochemical properties

Funding

  1. National Key Basic Research Program of China [2009CB918802]
  2. National Natural Science Foundation of China [31070259, 61202167]
  3. National Health and Medical Research Council of Australia (NHMRC)
  4. Hundred Talents Program of the Chinese Academy of Sciences (CAS)
  5. Knowledge Innovative Program of CAS [KSCX2-EW-G-8]
  6. Tianjin Municipal Science & Technology Commission [10ZCKFSY05600]

Ask authors/readers for more resources

As one of the most common post-translational modifications, ubiquitination regulates the quantity and function of a variety of proteins. Experimental and clinical investigations have also suggested the crucial roles of ubiquitination in several human diseases. The complicated sequence context of human ubiquitination sites revealed by proteomic studies highlights the need of developing effective computational strategies to predict human ubiquitination sites. Here we report the establishment of a novel human-specific ubiquitination site predictor through the integration of multiple complementary classifiers. Firstly, a Support Vector Machine (SVM) classier was constructed based on the composition of k-spaced amino acid pairs (CKSAAP) encoding, which has been utilized in our previous yeast ubiquitination site predictor. To further exploit the pattern and properties of the ubiquitination sites and their flanking residues, three additional SVM classifiers were constructed using the binary amino acid encoding, the AAindex physicochemical property encoding and the protein aggregation propensity encoding, respectively. Through an integration that relied on logistic regression, the resulting predictor termed hCKSAAP_UbSite achieved an area under ROC curve (AUC) of 0.770 in 5-fold cross-validation test on a class-balanced training dataset. When tested on a class-balanced independent testing dataset that contains 3419 ubiquitination sites, hCKSAAP_UbSite has also achieved a robust performance with an AUC of 0.757. Specifically, it has consistently performed better than the predictor using the CKSAAP encoding alone and two other publicly available predictors which are not human-specific. Given its promising performance in our large-scale datasets, hCKSAAP_UbSite has been made publicly available at our server (http://protein.cau.edu.cnicksaap_ubsite/). (C) 2013 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available