期刊
ISCIENCE
卷 24, 期 9, 页码 -出版社
CELL PRESS
DOI: 10.1016/j.isci.2021.103017
关键词
-
资金
- NSF [MCB1925646, MCB-1925643]
- Butler University Innovation
Machine learning analysis revealed several functionality rules of transcriptional activation domains (tADs), including distribution patterns of specific amino acid residues. These rules are not absolute, but contribute to understanding the mechanism of gene expression activation.
The mechanisms by which transcriptional activation domains ( tADs) initiate eukaryotic gene expression have been an enigma for decades because most tADs lack specificity in sequence, structure, and interactions with targets. Machine learning analysis of data sets of tAD sequences generated in vivo elucidated several functionality rules: the functional tAD sequences should (i) be devoid of or depleted with basic amino acid residues, (ii) be enriched with aromatic and acidic residues, (iii) be with aromatic residues localized mostly near the terminus of the sequence, and acidic residues localized more internally within a span of 20-30 amino acids, (iv) be with both aromatic and acidic residues preferably spread out in the sequence and not clustered, and (v) not be separated by occasional basic residues. These and other more subtle rules are not absolute, reflecting absence of a tAD consensus sequence, enormous variability, and consistent with surfactant-like tAD biochemical properties. The findings are compatible with the paradigm-shifting nucleosome detergent mechanismof gene expression activation, contributing to the development of the liquid-liquid phase separation model and the biochemistry of near-stochastic functional allosteric interactions.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据