4.7 Article

PyroHMMvar: a sensitive and accurate method to call short indels and SNPs for Ion Torrent and 454 data

期刊

BIOINFORMATICS
卷 29, 期 22, 页码 2859-2868

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btt512

关键词

-

资金

  1. National Basic Research Program of China [2012CB316504]
  2. National High Technology Research and Development Program of China [2012AA020401]
  3. National Natural Science Foundation of China [61175002, 60805010]
  4. Tsinghua University Initiative Scientific Research Program, Center of Excellence of Genome Sciences: Genomic Analysis of the Genotype-Phenotype Map [NIH/HG 2 P50 HG002790-06]

向作者/读者索取更多资源

Motivation: The identification of short insertions and deletions (indels) and single nucleotide polymorphisms (SNPs) from Ion Torrent and 454 reads is a challenging problem, essentially because these techniques are prone to sequence erroneously at homopolymers and can, therefore, raise indels in reads. Most of the existing mapping programs do not model homopolymer errors when aligning reads against the reference. The resulting alignments will then contain various kinds of mismatches and indels that confound the accurate determination of variant loci and alleles. Results: To address these challenges, we realign reads against the reference using our previously proposed hidden Markov model that models homopolymer errors and then merges these pairwise alignments into a weighted alignment graph. Based on our weighted alignment graph and hidden Markov model, we develop a method called PyroHMMvar, which can simultaneously detect short indels and SNPs, as demonstrated in human resequencing data. Specifically, by applying our methods to simulated diploid datasets, we demonstrate that PyroHMMvar produces more accurate results than state-of-the-art methods, such as Samtools and GATK, and is less sensitive to mapping parameter settings than the other methods. We also apply PyroHMMvar to analyze one human whole genome resequencing dataset, and the results confirm that PyroHMMvar predicts SNPs and indels accurately.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据