4.2 Article

HLA haplotype frequency estimation for heterogeneous populations using a graph-based imputation algorithm

期刊

HUMAN IMMUNOLOGY
卷 82, 期 10, 页码 746-757

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.humimm.2021.07.001

关键词

Multi-region expectation-maximization; algorithm; HLA; Haplotype frequencies

向作者/读者索取更多资源

A novel multi-region EM implementation that integrates HLA information from different population groups has been developed, showing higher likelihood values and better haplotype recovery than other EM implementations when tested on real and simulated datasets. This approach addresses the need for improved HLA imputation and matching for multi-region populations.
HLA haplotype frequencies are estimated from ambiguous unphased HLA genotyping data using Expectation-Maximization (EM) algorithms. Current population genetics methods require independent EM frequency estimates for each population, and assume that each population is in Hardy-Weinberg Equilibrium (HWE). The HWE assumption of EM has thus far resulted in the exclusion of individuals from mixed or unknown ethnic backgrounds from reference datasets. Multi-region populations are currently poorly served by stem cell donor registry HLA imputation and matching implementations due to the inability of such algorithms to incorporate admixture into their population genetics models. To address this unmet need, we have expanded the imputation component of our GRaph IMputation and Matching (GRIMM) framework, where imputation becomes the expectation step in an iterative EM algorithm. Our novel multi-region EM implementation considers region as a Bayesian prior, enabling integration of HLA information from multiple single-region population groups, and for the first time including individuals with ambiguous or mixed ethnic backgrounds. We show that our multi-region EM produces much higher likelihood values and better haplotype recovery as measured by Kullback-Leibler divergence than all evaluated EM implementations when tested on real datasets of US donor registry HLA typings as well as simulated multi-region datasets of ambiguous HLA typings. (c) 2021 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据