4.8 Article

Discovering transcriptional regulatory regions in Drosophila by a nonalignment method for phylogenetic footprinting

Publisher

NATL ACAD SCIENCES
DOI: 10.1073/pnas.0701614104

Keywords

enhancer discovery; regulatory sequence evolution; regulatory sequence prediction; transcription regulation; comparative genomics

Funding

  1. NCI NIH HHS [U54 CA121852-02, U54 CA121852] Funding Source: Medline
  2. NIGMS NIH HHS [GM074105, R24 GM074105, GM054510, R01 GM054510] Funding Source: Medline

Ask authors/readers for more resources

The functional annotation of the nonprotein-coding DNA of eukaryotic genomes is a problem of central importance. Phylogenetic footprinting methods, which attempt to identify functional regulatory regions by comparing orthologous genomic sequences of evolutionarily related species, have shown promising results. The main advantage of this class of approaches is that they do not require any knowledge of the regulating transcription factors. Here we describe a method called Enhancer Detection using only Genomic Information (EDGI), which integrates a traditional motif-discovery algorithm with a local permutation-clustering algorithm. Together, they can identify large regulatory elements (e.g., enhancers) as evolutionarily conserved order-independent clusters of short conserved motifs. We show that EDGI can distinguish between established sets of known enhancers and nonenhancers with 88% accuracy, rivaling predictions by methods that rely on the knowledge of the regulating transcription factors and their DNA-binding specificities. We tested EDGI's performance on a set of Drosophila genomes. Our results demonstrate that comparative genomic analysis of multiple closely related species has substantial power to identify key functional elements without additional biological knowledge.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available