4.8 Article

Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts

Journal

NATURE GENETICS
Volume 52, Issue 6, Pages 634-+

Publisher

NATURE PORTFOLIO
DOI: 10.1038/s41588-020-0621-6

Keywords

-

Funding

  1. Stiftelsen Kristian Gerhard Jebsen
  2. Faculty of Medicine and Health Sciences at the Norwegian University of Science and Technology (NTNU)
  3. Central Norway Regional Health Authority
  4. National Institutes of Health [R01HG008773]
  5. National Human Genome Research Institute of the National Institutes of Health [T32HG010464]

Ask authors/readers for more resources

SAIGE-GENE is a scalable generalized mixed-model region-based association test that can analyze large datasets while accounting for sample relatedness and unbalanced case-control ratios for binary traits. With very large sample sizes, biobanks provide an exciting opportunity to identify genetic components of complex traits. To analyze rare variants, region-based multiple-variant aggregate tests are commonly used to increase power for association tests. However, because of the substantial computational cost, existing region-based tests cannot analyze hundreds of thousands of samples while accounting for confounders such as population stratification and sample relatedness. Here we propose a scalable generalized mixed-model region-based association test, SAIGE-GENE, that is applicable to exome-wide and genome-wide region-based analysis for hundreds of thousands of samples and can account for unbalanced case-control ratios for binary traits. Through extensive simulation studies and analysis of the HUNT study with 69,716 Norwegian samples and the UK Biobank data with 408,910 White British samples, we show that SAIGE-GENE can efficiently analyze large-sample data (N > 400,000) with type I error rates well controlled.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available