4.4 Article

GBScleanR: robust genotyping error correction using a hidden Markov model with error pattern recognition

期刊

GENETICS
卷 224, 期 2, 页码 -

出版社

GENETICS SOCIETY AMERICA
DOI: 10.1093/genetics/iyad055

关键词

reduced-representation sequencing; error correction; imputation; hidden Markov model

向作者/读者索取更多资源

This paper introduces an error correction tool called GBScleanR, which incorporates marker-specific error rates into the hidden Markov model (HMM) to achieve robust and precise error correction for noisy RRS-based genotype data. The results indicate that GBScleanR improves accuracy by more than 25 percentage points compared to existing tools in simulation data sets and achieves the most reliable genotype estimation in real data even with error-prone markers.
Reduced-representation sequencing (RRS) provides cost-effective and time-saving genotyping platforms. Despite the outstanding advantage of RRS in throughput, the obtained genotype data usually contain a large number of errors. Several error correction methods employing the hidden Markov model (HMM) have been developed to overcome these issues. These methods assume that markers have a uniform error rate with no bias in the allele read ratio. However, bias does occur because of uneven amplification of genomic fragments and read mismapping. In this paper, we introduce an error correction tool, GBScleanR, which enables robust and precise error correction for noisy RRS-based genotype data by incorporating marker-specific error rates into the HMM. The results indicate that GBScleanR improves the accuracy by more than 25 percentage points at maximum compared to the existing tools in simulation data sets and achieves the most reliable genotype estimation in real data even with error-prone markers.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据