3.8 Article

A Super Scalable Algorithm for Short Segment Detection

期刊

STATISTICS IN BIOSCIENCES
卷 13, 期 1, 页码 18-33

出版社

SPRINGER
DOI: 10.1007/s12561-020-09278-z

关键词

Copy number variation; Inference; Nonparametric method; Signal detection

资金

  1. NSF [DMS-1722691, CCF-1740858, DMS-1722562, DMS-1722544]
  2. Simons Foundation [524432]
  3. University of Arizona
  4. NIH [R01DA016750, 1R01HG010171, R01MH116527]

向作者/读者索取更多资源

This paper presents a super scalable short segment detection algorithm that can effectively identify short segments hidden in long sequences and assign significance levels to the detected segments. The algorithm is computationally efficient, does not rely on Gaussian noise assumption, and demonstrates advantages through theoretical, simulation, and real data studies.
In many applications such as copy number variant (CNV) detection, the goal is to identify short segments on which the observations have different means or medians from the background. Those segments are usually short and hidden in a long sequence and hence are very challenging to find. We study a super scalable short segment (4S) detection algorithm in this paper. This nonparametric method clusters the locations where the observations exceed a threshold for segment detection. It is computationally efficient and does not rely on Gaussian noise assumption. Moreover, we develop a framework to assign significance levels for detected segments. We demonstrate the advantages of our proposed method by theoretical, simulation, and real data studies.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据