4.7 Article

Motif discovery and motif finding from genome-mapped DNase footprint data

Journal

BIOINFORMATICS
Volume 25, Issue 18, Pages 2318-2325

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btp434

Keywords

-

Funding

  1. Russian Fund of Basic Research projects [07-04-01623, 07-04-01584]
  2. INTAS Project [05-1000008-8028]
  3. Russian Federation Agency in Science and Innovation State Contract [02.531.11.9003]
  4. Russian Academy of Sciences Program in Molecular and Cellular Biology [10]
  5. French INRIA Equipe associe MIGEC

Ask authors/readers for more resources

Motivation: Footprint data is an important source of information on transcription factor recognition motifs. However, a footprinting fragment can contain no sequences similar to known protein recognition sites. Inspection of genome fragments nearby can help to identify missing site positions. Results: Genome fragments containing footprints were supplied to a pipeline that constructed a position weight matrix ( PWM) for different motif lengths and selected the optimal PWM. Fragments were aligned with the SeSiMCMC sampler and a new heuristic algorithm, Bigfoot. Footprints with missing hits were found for similar to 50% of factors. Adding only 2 bp on both sides of a footprinting fragment recovered most hits. We automatically constructed motifs for 41 Drosophila factors. New motifs can recognize footprints with a greater sensitivity at the same false positive rate than existing models. Also we discuss possible overfitting of constructed motifs.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available