期刊
GENOME BIOLOGY
卷 16, 期 -, 页码 -出版社
BMC
DOI: 10.1186/s13059-015-0758-2
关键词
-
资金
- National Institute of Health [R01-GM109836, R01-HG007834]
- Computational Cancer Biology Training Program, Cancer Prevention and Research Institute of Texas (CPRIT) [RP140113]
- National Cancer Institute [1R01CA183793, 1R01CA174206, P30 CA016672]
- Cancer Prevention Research Institute of Texas [RP130090]
SomaticSeq is an accurate somatic mutation detection pipeline implementing a stochastic boosting algorithm to produce highly accurate somatic mutation calls for both single nucleotide variants and small insertions and deletions. The workflow currently incorporates five state-of-the-art somatic mutation callers, and extracts over 70 individual genomic and sequencing features for each candidate site. A training set is provided to an adaptively boosted decision tree learner to create a classifier for predicting mutation statuses. We validate our results with both synthetic and real data. We report that SomaticSeq is able to achieve better overall accuracy than any individual tool incorporated.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据