4.3 Article

Transcriptome-wide association studies accounting for colocalization using Egger regression

期刊

GENETIC EPIDEMIOLOGY
卷 42, 期 5, 页码 418-433

出版社

WILEY
DOI: 10.1002/gepi.22131

关键词

gene Expression; genome-wide association study; Mendelian randomization; transciptome-wide association study

资金

  1. U.S. National Institute of Health [R01 HG009120, R01 GM105857]
  2. U.S. National Cancer Institute [R35 CA197449]
  3. National Institutes of Health [U19 CA148065, X01HG007492]
  4. Cancer Research UK [C1287/A10118, C1287/A16563, C1287/A10710]
  5. European Union [HEALTH-F2-2009-223175, H2020 633784, 634935]
  6. FAS Division of Science, Research Computing Group at Harvard University
  7. [P01 CA134294]
  8. [PSR-SIIRI-701]
  9. NATIONAL CANCER INSTITUTE [R35CA197449, U19CA148065, P01CA134294] Funding Source: NIH RePORTER
  10. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [R01HG009120] Funding Source: NIH RePORTER
  11. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM105857] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Integrating genome-wide association (GWAS) and expression quantitative trait locus (eQTL) data into transcriptome-wide association studies (TWAS) based on predicted expression can boost power to detect novel disease loci or pinpoint the susceptibility gene at a known disease locus. However, it is often the case that multiple eQTL genes colocalize at disease loci, making the identification of the true susceptibility gene challenging, due to confounding through linkage disequilibrium (LD). To distinguish between true susceptibility genes (where the genetic effect on phenotype is mediated through expression) and colocalization due to LD, we examine an extension of the Mendelian randomization (MR) egger regression method that allows for LD while only requiring summary association data for both GWAS and eQTL. We derive the standard TWAS approach in the context of MR and show in simulations that the standard TWAS does not control type I error for causal gene identification when eQTLs have pleiotropic or LD-confounded effects on disease. In contrast, LD-aware MR-Egger (LDA MR-Egger) regression can control type I error in this case while attaining similar power as other methods in situations where these provide valid tests. However, when the direct effects of genetic variants on traits are correlated with the eQTL associations, all of the methods we examined including LDA MR-Egger regression can have inflated type I error. We illustrate these methods by integrating gene expression within a recent large-scale breast cancer GWAS to provide guidance on susceptibility gene identification.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据