4.7 Article

UniBind: maps of high-confidence direct TF-DNA interactions across nine species

期刊

BMC GENOMICS
卷 22, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12864-021-07760-6

关键词

Transcription factor binding sites; ChIP-seq; TF-DNA interactions; Transcription regulation; Evolutionary conservation; Cis-regulatory modules; UniBind

资金

  1. Norwegian Research Council [187615, 288404]
  2. Helse Sor-Ost
  3. University of Oslo through the Centre for Molecular Medicine Norway (NCMM)
  4. Norwegian Cancer Society [197884]

向作者/读者索取更多资源

This study processed around 10,000 ChIP-seq datasets to predict approximately 56 million TFBSs, supporting direct TF-DNA interactions for 644 TFs in multiple cell lines and tissues, and predicting over 197,000 cis-regulatory modules. The high-quality of TFBSs was reinforced by their evolutionary conservation, enrichment at active cis-regulatory regions, and ability to predict combinatorial binding of TFs.
Background: Transcription factors (TFs) bind specifically to TF binding sites (TFBSs) at cis-regulatory regions to control transcription. It is critical to locate these TF-DNA interactions to understand transcriptional regulation. Efforts to predict bona fide TFBSs benefit from the availability of experimental data mapping DNA binding regions of TFs (chromatin immunoprecipitation followed by sequencing - ChIP-seq). Results: In this study, we processed similar to 10,000 public ChIP-seq datasets from nine species to provide high-quality TFBS predictions. After quality control, it culminated with the prediction of similar to 56 million TFBSs with experimental and computational support for direct TF-DNA interactions for 644 TFs in > 1000 cell lines and tissues. These TFBSs were used to predict > 197,000 cis-regulatory modules representing clusters of binding events in the corresponding genomes. The high-quality of the TFBSs was reinforced by their evolutionary conservation, enrichment at active cis-regulatory regions, and capacity to predict combinatorial binding of TFs. Further, we confirmed that the cell type and tissue specificity of enhancer activity was correlated with the number of TFs with binding sites predicted in these regions. All the data is provided to the community through the UniBind database that can be accessed through its web-interface (https://unibind.uio.no/), a dedicated RESTful API, and as genomic tracks. Finally, we provide an enrichment tool, available as a web-service and an R package, for users to find TFs with enriched TFBSs in a set of provided genomic regions. Conclusions: UniBind is the first resource of its kind, providing the largest collection of high-confidence direct TF-DNA interactions in nine species.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据