4.7 Article

Holistic similarity-based prediction of phosphorylation sites for understudied kinases

Journal

BRIEFINGS IN BIOINFORMATICS
Volume 24, Issue 2, Pages -

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac624

Keywords

understudied kinase; kinase-specific phosphorylation; phosphorylation site prediction; protein-protein interaction; protein similarity

Ask authors/readers for more resources

Phosphorylation is a crucial mechanism for regulating protein activities, but experimental determination of kinase-specific phosphorylation sites is time-consuming and expensive. This study aims to create predictive models for understudied kinases by leveraging relevant sources of similarities. Results demonstrate that 82 out of 116 understudied kinases were predicted with adequate performance, proving the reliability of web-like predictive networks in capturing patterns in such kinases.
Phosphorylation is an essential mechanism for regulating protein activities. Determining kinase-specific phosphorylation sites by experiments involves time-consuming and expensive analyzes. Although several studies proposed computational methods to model kinase-specific phosphorylation sites, they typically required abundant experimentally verified phosphorylation sites to yield reliable predictions. Nevertheless, the number of experimentally verified phosphorylation sites for most kinases is relatively small, and the targeting phosphorylation sites are still unidentified for some kinases. In fact, there is little research related to these understudied kinases in the literature. Thus, this study aims to create predictive models for these understudied kinases. A kinase-kinase similarity network was generated by merging the sequence-, functional-, protein-domain- and 'STRING'-related similarities. Thus, besides sequence data, protein-protein interactions and functional pathways were also considered to aid predictive modelling. This similarity network was then integrated with a classification of kinase groups to yield highly similar kinases to a specific understudied type of kinase. Their experimentally verified phosphorylation sites were leveraged as positive sites to train predictive models. The experimentally verified phosphorylation sites of the understudied kinase were used for validation. Results demonstrate that 82 out of 116 understudied kinases were predicted with adequate performance via the proposed modelling strategy, achieving a balanced accuracy of 0.81, 0.78, 0.84, 0.84, 0.85, 0.82, 0.90, 0.82 and 0.85, for the 'TK', 'Other', 'STE', 'CAMK', 'TKL', 'CMGC', 'AGC', 'CK1' and 'Atypical' groups, respectively. Therefore, this study demonstrates that web-like predictive networks can reliably capture the underlying patterns in such understudied kinases by harnessing relevant sources of similarities to predict their specific phosphorylation sites.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available