4.7 Article

Efficient and effective control of confounding in eQTL mapping studies through joint differential expression and Mendelian randomization analyses

期刊

BIOINFORMATICS
卷 37, 期 3, 页码 296-302

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btaa715

关键词

-

资金

  1. National Institutes of Health (NIH) [R01HG009124, R01GM126553]
  2. National Science Foundation (NSF) [DMS1712933]
  3. China Scholarship Council

向作者/读者索取更多资源

ECCO is presented as a simple and computationally scalable alternative to determine the optimal number of PEER factors for eQTL mapping studies, using joint analysis of differential expression and Mendelian randomization. It shows similar effectiveness to the standard approach in identifying the required number of PEER factors for eQTL mapping, but is two orders of magnitude faster. This computational scalability allows for optimized eQTL discovery across 48 GTEx tissues, resulting in a 5.89% power gain in the number of eGenes discovered compared to previous recommendations.
Motivation: Identifying cis-acting genetic variants associated with gene expression levels-an analysis commonly referred to as expression quantitative trait loci (eQTLs) mapping-is an important first step toward understanding the genetic determinant of gene expression variation. Successful eQTL mapping requires effective control of confounding factors. A common method for confounding effects control in eQTL mapping studies is the probabilistic estimation of expression residual (PEER) analysis. PEER analysis extracts PEER factors to serve as surrogates for confounding factors, which is further included in the subsequent eQTL mapping analysis. However, it is computationally challenging to determine the optimal number of PEER factors used for eQTL mapping. In particular, the standard approach to determine the optimal number of PEER factors examines one number at a time and chooses a number that optimizes eQTLs discovery. Unfortunately, this standard approach involves multiple repetitive eQTL mapping procedures that are computationally expensive, restricting its use in large-scale eQTL mapping studies that being collected today. Results: Here, we present a simple and computationally scalable alternative, Effect size Correlation for COnfounding determination (ECCO), to determine the optimal number of PEER factors used for eQTL mapping studies. Instead of performing repetitive eQTL mapping, ECCO jointly applies differential expression analysis and Mendelian randomization analysis, leading to substantial computational savings. In simulations and real data applications, we show that ECCO identifies a similar number of PEER factors required for eQTL mapping analysis as the standard approach but is two orders of magnitude faster. The computational scalability of ECCO allows for optimized eQTL discovery across 48 GTEx tissues for the first time, yielding an overall 5.89% power gain on the number of eQTL harboring genes (eGenes) discovered as compared to the previous GTEx recommendation that does not attempt to determine tissue-specific optimal number of PEER factors.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据