4.7 Article

CLIP: accurate prediction of disordered linear interacting peptides from protein sequences using co-evolutionary information

期刊

BRIEFINGS IN BIOINFORMATICS
卷 -, 期 -, 页码 -

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbac502

关键词

intrinsic disorder; protein-protein interactions; protein-nucleic acids interactions; linear interacting peptides; protein function; molecular recognition features

资金

  1. National Natural Science Foundation of China [61873185, T2222012]
  2. Robert J. Mattauch Endowment
  3. National Science Foundation [2146027]
  4. Div Of Biological Infrastructure
  5. Direct For Biological Sciences [2146027] Funding Source: National Science Foundation

向作者/读者索取更多资源

This paper introduces a new method for predicting intrinsically disordered regions (IDRs) called CLIP. CLIP uses inputs such as co-evolutionary information, physicochemical profiles, and disorder predictions to predict linear interacting peptides (LIPs) in protein sequences. Experimental results show that CLIP achieves good performance in predicting LIPs and outperforms current tools for predicting MoRFs and disordered protein-binding regions.
One of key features of intrinsically disordered regions (IDRs) is facilitation of protein-protein and protein-nucleic acids interactions. These disordered binding regions include molecular recognition features (MoRFs), short linear motifs (SLiMs) and longer binding domains. Vast majority of current predictors of disordered binding regions target MoRFs, with a handful of methods that predict SLiMs and disordered protein-binding domains. A new and broader class of disordered binding regions, linear interacting peptides (LIPs), was introduced recently and applied in the MobiDB resource. LIPs are segments in protein sequences that undergo disorder to-order transition upon binding to a protein or a nucleic acid, and they cover MoRFs, SLiMs and disordered protein-binding domains. Although current predictors of MoRFs and disordered protein-binding regions could be used to identify some LIPs, there are no dedicated sequence-based predictors of LIPs. To this end, we introduce CLIP, a new predictor of LIPs that utilizes robust logistic regression model to combine three complementary types of inputs: co-evolutionary information derived from multiple sequence alignments, physicochemical profiles and disorder predictions. Ablation analysis suggests that the co-evolutionary information is particularly useful for this prediction and that combining the three inputs provides substantial improvements when compared to using these inputs individually. Comparative empirical assessments using low-similarity test datasets reveal that CLIP secures area under receiver operating characteristic curve (AUC) of 0.8 and substantially improves over the results produced by the closest current tools that predict MoRFs and disordered protein-binding regions. The webserver of CLIP is freely available at http://biomine.cs.vcu.edu/servers/CLIP/ and the standalone code can be downloaded from http://yanglab.qd.sdu.edu.cn/download/CLIP/.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据