☆ 4.8 Article

NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence

NUCLEIC ACIDS RESEARCH (2005)

期刊

NUCLEIC ACIDS RESEARCH

卷 33, 期 5, 页码 1445-1453

出版社

OXFORD UNIV PRESS

DOI: 10.1093/nar/gki282

关键词

类别

Biochemistry & Molecular Biology

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

NestedMICA is a new, scalable, pattern-discovery system for finding transcription factor binding sites and similar motifs in biological sequences. Like several previous methods, NestedMICA tackles this problem by optimizing a probabilistic mixture model to fit a set of sequences. However, the use of a newly developed inference strategy called Nested Sampling means NestedMICA is able to find optimal solutions without the need for a problematic initialization or seeding step. We investigate the performance of NestedMICA in a range scenario, on synthetic data and a well-characterized set of muscle regulatory regions, and compare it with the popular MEME program. We show that the new method is significantly more sensitive than MEME: in one case, it successfully extracted a target motif from background sequence four times longer than could be handled by the existing program. It also performs robustly on synthetic sequences containing multiple significant motifs. When tested on a real set of regulatory sequences, NestedMICA produced motifs which were good predictors for all five abundant classes of annotated binding sites.

NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence

期刊

NUCLEIC ACIDS RESEARCH

出版社

OXFORD UNIV PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

NestedMICA: sensitive inference of over-represented motifs in nucleic acid sequence

期刊

NUCLEIC ACIDS RESEARCH

出版社

OXFORD UNIV PRESS

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文