4.8 Article

Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs

Journal

NUCLEIC ACIDS RESEARCH
Volume 43, Issue 1, Pages 74-83

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/nar/gku1261

Keywords

-

Funding

  1. National Science Foundation [1149955, 1125676, 1218275]
  2. Direct For Biological Sciences
  3. Div Of Biological Infrastructure [1149955] Funding Source: National Science Foundation
  4. Directorate For Engineering
  5. Div Of Chem, Bioeng, Env, & Transp Sys [1125676] Funding Source: National Science Foundation
  6. Div Of Information & Intelligent Systems
  7. Direct For Computer & Info Scie & Enginr [1218275] Funding Source: National Science Foundation

Ask authors/readers for more resources

Comprehensive motif discovery under experimental conditions is critical for the global understanding of gene regulation. To generate a nearly complete list of human DNA motifs under given conditions, we employed a novel approach to de novo discover significant co-occurring DNA motifs in 349 human DNase I hypersensitive site datasets. We predicted 845 to 1325 motifs in each dataset, for a total of 2684 non-redundant motifs. These 2684 motifs contained 54.02 to 75.95% of the known motifs in seven large collections including TRANSFAC. In each dataset, we also discovered 43 663 to 2 013 288 motif modules, groups of motifs with their binding sites co-occurring in a significant number of short DNA regions. Compared with known interacting transcription factors in eight resources, the predicted motif modules on average included 84.23% of known interacting motifs. We further showed new features of the predicted motifs, such as motifs enriched in proximal regions rarely overlapped with motifs enriched in distal regions, motifs enriched in 5' distal regions were often enriched in 3' distal regions, etc. Finally, we observed that the 2684 predicted motifs classified the cell or tissue types of the datasets with an accuracy of 81.29%. The resources generated in this study are available at http://server.cs.ucf.edu/predrem/.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available