期刊
PROTEIN ENGINEERING
卷 13, 期 3, 页码 149-152出版社
OXFORD UNIV PRESS
DOI: 10.1093/protein/13.3.149
关键词
minimal alphabet; protein fold recognition; sequence alignment
资金
- NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM030580] Funding Source: NIH RePORTER
- NIGMS NIH HHS [GM30580] Funding Source: Medline
Protein design experiments have shown that the use of specific subsets of amino acids can produce foldable proteins. This prompts the question of whether there is a minimal amino acid alphabet which could be used to fold all proteins. In this work we make an analogy between sequence patterns which produce foldable sequences and those which make it possible to detect structural homologs by aligning sequences, and use it to suggest the possible size of such a reduced alphabet. We estimate that reduced alphabets containing 10-12 letters can be used to design foldable sequences for a large number of protein families. This estimate is based on the observation that there is little loss of the information necessary to pick out structural homologs in a clustered protein sequence database when a suitable reduction of the amino acid alphabet from 20 to 10 letters is made, but that this information is rapidly degraded when further reductions in the alphabet are made.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据