4.7 Article

Procleave: Predicting Protease-specific Substrate Cleavage Sites by Combining Sequence and Structural Information

期刊

GENOMICS PROTEOMICS & BIOINFORMATICS
卷 18, 期 1, 页码 52-64

出版社

ELSEVIER
DOI: 10.1016/j.gpb.2019.08.002

关键词

Protease; Cleavage site prediction; Machine learning; Conditional random field; Structural determinants

资金

  1. Australian Research Council (ARC) [LP110200333, DP120104460]
  2. National Health and Medical Research Council of Australia (NHMRC) [APP1127948, APP1144652, APP490989]
  3. National Institute of Allergy and InfectiousDiseases of theNational Institutes of Health, USA [R01 AI111965]
  4. Major Inter-Disciplinary Research (IDR) - Monash University, Australia [2019-32, 2018-28]
  5. School of Medicine, University of Alabama at Birmingham, USA

向作者/读者索取更多资源

Proteases are enzymes that cleave and hydrolyse the peptide bonds between two specific amino acid residues of target substrate proteins. Protease-controlled proteolysis plays a key role in the degradation and recycling of proteins, which is essential for various physiological processes. Thus, solving the substrate identification problem will have important implications for the precise understanding of functions and physiological roles of proteases, as well as for therapeutic target identification and pharmaceutical applicability. Consequently, there is a great demand for bioinfor-matics methods that can predict novel substrate cleavage events with high accuracy by utilizing both sequence and structural information. In this study, we present Procleave, a novel bioinformatics approach for predicting protease-specific substrates and specific cleavage sites by taking into account both their sequence and 3D structural information. Structural features of known cleavage sites were represented by discrete values using a LOWESS data-smoothing optimization method, which turned out to be critical for the performance of Procleave. The optimal approximations of all structural parameter values were encoded in a conditional random field (CRF) computational frame-work, alongside sequence and chemical group-based features. Here, we demonstrate the outstand-ing performance of Procleave through extensive benchmarking and independent tests. Procleave is capable of correctly identifying most cleavage sites in the case study. Importantly, when applied to the human structural proteome encompassing 17,628 protein structures, Procleave suggests a num-ber of potential novel target substrates and their corresponding cleavage sites of different proteases. Procleave is implemented as a webserver and is freely accessible at http://procleave.erc.monash.edu/.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据