4.6 Article

A two-layer integration framework for protein complex detection

期刊

BMC BIOINFORMATICS
卷 17, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12859-016-0939-3

关键词

Protein complex; Protein interaction data; Co-complex matrix; Consensus matrix; Matrix fusion; Matrix decomposition

资金

  1. National Science Foundation of China [11171354, 61375033, 61532008, 61402190]
  2. Ministry of Education of China [20120171110016]
  3. Natural Science Foundation of Guangdong Province [S2013020012796]
  4. Self-determined Research Funds of CCNU from the colleges' basic research and operation of MOE [CCNU15A05039, CCNU15ZD011]
  5. City University of Hong Kong [9610034]

向作者/读者索取更多资源

Background: Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have recently been developed to predict protein complexes. However, different computational methods are based on their own assumptions and designed to work on different data sources, and various biological screening methods have their unique experiment conditions, and are often different in scale and noise level. Therefore, a single computational method on a specific data source is generally not able to generate comprehensive and reliable prediction results. Results: In this paper, we develop a novel Two-layer INtegrative Complex Detection (TINCD) model to detect protein complexes, leveraging the information from both clustering results and raw data sources. In particular, we first integrate various clustering results to construct consensus matrices for proteins to measure their overall co-complex propensity. Second, we combine these consensus matrices with the co-complex score matrix derived from Tandem Affinity Purification/Mass Spectrometry (TAP) data and obtain an integrated co-complex similarity network via an unsupervised metric fusion method. Finally, a novel graph regularized doubly stochastic matrix decomposition model is proposed to detect overlapping protein complexes from the integrated similarity network. Conclusions: Extensive experimental results demonstrate that TINCD performs much better than 21 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据