4.5 Article Proceedings Paper

Maximum entropy modeling of short sequence motifs with applications to RNA splicing signals

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY
卷 11, 期 2-3, 页码 377-394

出版社

MARY ANN LIEBERT, INC
DOI: 10.1089/1066527041410418

关键词

maximum entropy; splice sites; nonneighboring dependencies; Markov models; maximal dependence decomposition; molecular sequence analysis; sequence motif

向作者/读者索取更多资源

We propose a framework for modeling sequence motifs based on the maximum entropy principle (MEP). We recommend approximating short sequence motif distributions with the maximum entropy distribution (MED) consistent with low-order marginal constraints estimated from available data, which may include dependencies between nonadjacent as well as adjacent positions. Many maximum entropy models (MEMs) are specified by simply changing the set of constraints. Such models can be utilized to discriminate between signals and decoys. Classification performance using different MEMs gives insight into the relative importance of dependencies between different positions. We apply our framework to large datasets of RNA splicing signals. Our best models out-perform previous probabilistic models in the discrimination of human 5' (donor) and 3' (acceptor) splice sites from decoys. Finally, we discuss mechanistically motivated ways of comparing models.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据