☆ 4.5 Article

Sequence Alignment as Hypothesis Testing

JOURNAL OF COMPUTATIONAL BIOLOGY (2011)

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

卷 18, 期 5, 页码 677-691

出版社

MARY ANN LIEBERT, INC

DOI: 10.1089/cmb.2010.0328

关键词

hypothesis testing; local alignment; power; scoring function; sequence alignment

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Computer Science, Interdisciplinary Applications Mathematical & Computational Biology Statistics & Probability

资金

NSFC [30675012, 60721003, 60928007, 60805010]
NIH [P50HG002790, R21AG032743]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Sequence alignment depends on the scoring function that defines similarity between pairs of letters. For local alignment, the computational algorithm searches for the most similar segments in the sequences according to the scoring function. The choice of this scoring function is important for correctly detecting segments of interest. We formulate sequence alignment as a hypothesis testing problem, and conduct extensive simulation experiments to study the relationship between the scoring function and the distribution of aligned pairs within the aligned segment under this framework. We cut through the many ways to construct scoring functions and showed that any scoring function with negative expectation used in local alignment corresponds to a hypothesis test between the background distribution of sequence letters and a statistical distribution of letter pairs determined by the scoring function. The results indicate that the log-likelihood ratio scoring function is statistically most powerful and has the highest accuracy for detecting the segments of interest that are defined by the statistical distribution of aligned letter pairs.

Sequence Alignment as Hypothesis Testing

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

出版社

MARY ANN LIEBERT, INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Sequence Alignment as Hypothesis Testing

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY

出版社

MARY ANN LIEBERT, INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文