4.6 Article

The HDAC9-associated risk locus promotes coronary artery disease by governing TWIST1

Journal

PLOS GENETICS
Volume 18, Issue 6, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pgen.1010261

Keywords

-

Funding

  1. NIH [R01HL130423, R01HL135093, R01HL148167, RG194194]
  2. Fondation Leducq
  3. Swedish Research Council [2018-02529]
  4. Heart Lung Foundation [20170265]
  5. Foundation Leducq [18CVD02, 12CVD02]
  6. Astra-Zeneca
  7. Bourne Foundation
  8. Agilent
  9. American Heart Association [20POST35120545]
  10. Swedish Research Council [2018-02529] Funding Source: Swedish Research Council
  11. Vinnova [2018-02529] Funding Source: Vinnova

Ask authors/readers for more resources

Genome wide association studies (GWAS) can identify single nucleotide polymorphisms (SNPs) associated with common disorders, but often do not provide causal mechanisms or tissue-specific effects. This study developed a data-driven informatics pipeline to gain insights on complex genetic loci, and applied it to understand a cluster of SNPs associated with coronary artery disease (CAD) near the HDAC9 gene. The pipeline demonstrated that this locus is causal for CAD by modulating TWIST1 expression levels in the arterial wall, and by governing a gene regulatory co-expression network related to metabolic function in skeletal muscle.
Genome wide association studies (GWAS) have identified thousands of single nucleotide polymorphisms (SNPs) associated with the risk of common disorders. However, since the large majority of these risk SNPs reside outside gene-coding regions, GWAS generally provide no information about causal mechanisms regarding the specific gene(s) that are affected or the tissue(s) in which these candidate gene(s) exert their effect. The 'gold standard' method for understanding causal genes and their mechanisms of action are laborious basic science studies often involving sophisticated knockin or knockout mouse lines, however, these types of studies are impractical as a high-throughput means to understand the many risk variants that cause complex diseases like coronary artery disease (CAD). As a solution, we developed a streamlined, data-driven informatics pipeline to gain mechanistic insights on complex genetic loci. The pipeline begins by understanding the SNPs in a given locus in terms of their relative location and linkage disequilibrium relationships, and then identifies nearby expression quantitative trait loci (eQTLs) to determine their relative independence and the likely tissues that mediate their disease-causal effects. The pipeline then seeks to understand associations with other disease-relevant genes, disease sub-phenotypes, potential causality (Mendelian randomization), and the regulatory and functional involvement of these genes in gene regulatory co-expression networks (GRNs). Here, we applied this pipeline to understand a cluster of SNPs associated with CAD within and immediately adjacent to the gene encoding HDAC9. Our pipeline demonstrated, and validated, that this locus is causal for CAD by modulation of TWIST1 expression levels in the arterial wall, and by also governing a GRN related to metabolic function in skeletal muscle. Our results reconciled numerous prior studies, and also provided clear evidence that this locus does not govern HDAC9 expression, structure or function. This pipeline should be considered as a powerful and efficient way to understand GWAS risk loci in a manner that better reflects the highly complex nature of genetic risk associated with common disorders. Author summaryGenome wide association studies (GWAS) have identified thousands of single nucleotide polymorphisms (SNPs) associated with the risk of common disorders. However, for the great majority of these SNPs, the causal mechanisms regarding the specific gene(s) that are affected or the tissue(s) in which these candidate gene(s) exert their effect are unknown. As a solution, we developed a streamlined, data-driven informatics pipeline to gain mechanistic insights on complex genetic loci. Here, we applied this pipeline to understand a cluster of SNPs associated with coronary artery disease (CAD) within and immediately adjacent to the gene encoding HDAC9. Our pipeline demonstrated, and validated, that this locus is causal for CAD by modulation of TWIST1 gene expression levels in the arterial wall, and by also governing a gene regulatory co-expression network related to metabolic function in skeletal muscle. Our results reconciled numerous prior studies, and also demonstrated that this locus does not govern HDAC9 expression, structure or function. This pipeline should be considered as a powerful and efficient way to understand GWAS risk loci.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available