4.5 Article

Promoter features related to tissue specificity as measured by Shannon entropy

期刊

GENOME BIOLOGY
卷 6, 期 4, 页码 -

出版社

BMC
DOI: 10.1186/gb-2005-6-4-r33

关键词

-

资金

  1. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [R01HG001539] Funding Source: NIH RePORTER
  2. NATIONAL INSTITUTE OF DIABETES AND DIGESTIVE AND KIDNEY DISEASES [R01DK063336] Funding Source: NIH RePORTER
  3. NHGRI NIH HHS [R01 HG001539, R01HG001539] Funding Source: Medline
  4. NIDDK NIH HHS [R01 DK063336, 1R01DK63336] Funding Source: Medline

向作者/读者索取更多资源

Background: The regulatory mechanisms underlying tissue specificity are a crucial part of the development and maintenance of multicellular organisms. A genome-wide analysis of promoters in the context of gene-expression patterns in tissue surveys provides a means of identifying the general principles for these mechanisms. Results: We introduce a definition of tissue specificity based on Shannon entropy to rank human genes according to their overall tissue specificity and by their specificity to particular tissues. We apply our definition to microarray-based and expressed sequence tag (EST)-based expression data for human genes and use similar data for mouse genes to validate our results. We show that most genes show statistically significant tissue-dependent variations in expression level. We find that the most tissue-specific genes typically have a TATA box, no CpG island, and often code for extracellular proteins. As expected, CpG islands are found in most of the least tissue-specific genes, which often code for proteins located in the nucleus or mitochondrion. The class of genes with no CpG island or TATA box are the most common mid-specificity genes and commonly code for proteins located in a membrane. Sp1 was found to be a weak indicator of less-specific expression. YY1 binding sites, either as initiators or as downstream sites, were strongly associated with the least-specific genes. Conclusions: We have begun to understand the components of promoters that distinguish tissue-specific from ubiquitous genes, to identify associations that can predict the broad class of gene expression from sequence data alone.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据