期刊
JOURNAL OF COMPUTATIONAL BIOLOGY
卷 30, 期 2, 页码 189-203出版社
MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2022.0067
关键词
genome-wide association studies; phylogenetics; genetic similarity
Genome-wide association studies are often affected by population stratification. Linear mixed models are a powerful method for controlling confounding factors. However, uncertainty about the phylogenetic structure may impact the quality of the results. This study proposes a linear mixed model that incorporates a genetic similarity matrix derived from Markov chain Monte Carlo estimates of the phylogeny.
Genome-wide association studies (GWASs) are often confounded by population stratification and structure. Linear mixed models (LMMs) are a powerful class of methods for uncovering genetic effects, while controlling for such confounding. LMMs include random effects for a genetic similarity matrix, and they assume that a true genetic similarity matrix is known. However, uncertainty about the phylogenetic structure of a study population may degrade the quality of LMM results. This may happen in bacterial studies in which the number of samples or loci is small, or in studies with low-quality genotyping. In this study, we develop methods for linear mixed models in which the genetic similarity matrix is unknown and is derived from Markov chain Monte Carlo estimates of the phylogeny. We apply our model to a GWAS of multidrug resistance in tuberculosis, and illustrate our methods on simulated data.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据