期刊
BMC GENOMICS
卷 8, 期 -, 页码 -出版社
BMC
DOI: 10.1186/1471-2164-8-161
关键词
-
资金
- Intramural NIH HHS Funding Source: Medline
- NCI NIH HHS [N01-CO-12400, N01CO12400] Funding Source: Medline
Background: Protein domains are fundamental evolutionary units of protein architecture, composing proteins in a modular manner. Combinations of two or more, possibly non- adjacent, domains are thought to play specific functional roles within proteins. Indeed, while the number of potential co- occurring domain sets ( CDSs) is very large, only a few of these occur in nature. Here we study the principles governing domain content of proteins, using yeast as a model species. Results: We design a novel representation of proteins and their constituent domains as a protein-domain network. An analysis of this network reveals 99 CDSs that occur in proteins more than expected by chance. The identified CDSs are shown to preferentially include ancient domains that are conserved from bacteria or archaea. Moreover, the protein sets spanned by these combinations were found to be highly functionally coherent, significantly match known protein complexes, and enriched with protein- protein interactions. These observations serve to validate the biological significance of the identified CDSs. Conclusion: Our work provides a comprehensive list of co- occurring domain sets in yeast, and sheds light on their function and evolution.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据