4.7 Article

KnockoffTrio: A knockoff framework for the identification of putative causal variants in genome-wide association studies with trio design

Journal

AMERICAN JOURNAL OF HUMAN GENETICS
Volume 109, Issue 10, Pages 1761-1776

Publisher

CELL PRESS
DOI: 10.1016/j.ajhg.2022.08.013

Keywords

-

Funding

  1. NIH/National Institute of Mental Health [MH106910, MH095797]

Ask authors/readers for more resources

Family-based designs can eliminate confounding effects and distinguish genetic effects, but suffer from low power due to limited sample sizes. KnockoffTrio is a statistical method that identifies potential causal genetic variants in father-mother-child trio designs. It controls false discovery rate and has shown effectiveness in identifying significant associations in autism spectrum disorder data.
Family-based designs can eliminate confounding due to population substructure and can distinguish direct from indirect genetic effects, but these designs are underpowered due to limited sample sizes. Here, we propose KnockoffTrio, a statistical method to identify putative causal genetic variants for father-mother-child trio design built upon a recently developed knockoff framework in statistics. KnockoffTrio controls the false discovery rate (FDR) in the presence of arbitrary correlations among tests and is less conservative and thus more powerful than the conventional methods that control the family-wise error rate via Bonferroni correction. Furthermore, KnockoffTrio is not restricted to family-based association tests and can be used in conjunction with more powerful, potentially nonlinear models to improve the power of standard family-based tests. We show, using empirical simulations, that KnockoffTrio can prioritize causal variants over associations due to linkage disequilibrium and can provide protection against confounding due to population stratification. In applications to 14,200 trios from three study cohorts for autism spectrum disorders (ASDs), including AGP, SPARK, and SSC, we show that KnockoffTrio can identify multiple significant associations that are missed by conventional tests applied to the same data. In particular, we replicate known ASD association signals with variants in several genes such as MACROD2, NRXN1, PRKARIB, CADM2, PCDH9, and DOCK4 and identify additional associations with variants in other genes including ARHGEF10, SLC28A1, ZNF589, and HINT1 at FDR 10%.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available