4.6 Article

A novel ensemble learning method for de novo computational identification of DNA binding sites

期刊

BMC BIOINFORMATICS
卷 8, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2105-8-249

关键词

-

资金

  1. NHGRI NIH HHS [T32 HG000035, T32 HG00035] Funding Source: Medline

向作者/读者索取更多资源

Background: Despite the diversity of motif representations and search algorithms, the de novo computational identification of transcription factor binding sites remains constrained by the limited accuracy of existing algorithms and the need for user- specified input parameters that describe the motif being sought. Results: We present a novel ensemble learning method, SCOPE, that is based on the assumption that transcription factor binding sites belong to one of three broad classes of motifs: non-degenerate, degenerate and gapped motifs. SCOPE employs a unified scoring metric to combine the results from three motif finding algorithms each aimed at the discovery of one of these classes of motifs. We found that SCOPE's performance on 78 experimentally characterized regulons from four species was a substantial and statistically significant improvement over that of its component algorithms. SCOPE outperformed a broad range of existing motif discovery algorithms on the same dataset by a statistically significant margin. Conclusion: SCOPE demonstrates that combining multiple, focused motif discovery algorithms can provide a significant gain in performance. By building on components that efficiently search for motifs without user- defined parameters, SCOPE requires as input only a set of upstream sequences and a species designation, making it a practical choice for non- expert users. A user- friendly web interface, Java source code and executables are available at http://genie.dartmouth.edu/scope.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据