Journal
NATURE COMMUNICATIONS
Volume 12, Issue 1, Pages -Publisher
NATURE RESEARCH
DOI: 10.1038/s41467-021-23655-2
Keywords
-
Categories
Funding
- National Heart, Lung, and Blood Institute (NHLBI) [R01HL120393-03S1, 1R35HL135818, 1R21HL145425]
Ask authors/readers for more resources
Pooling participant-level genetic data into a single analysis can result in variance stratification, reducing statistical performance. The authors developed variant-specific inflation factors to assess variance stratification and apply this to pooled individual-level data from whole genome sequencing.
In modern Whole Genome Sequencing (WGS) epidemiological studies, participant-level data from multiple studies are often pooled and results are obtained from a single analysis. We consider the impact of differential phenotype variances by study, which we term 'variance stratification'. Unaccounted for, variance stratification can lead to both decreased statistical power, and increased false positives rates, depending on how allele frequencies, sample sizes, and phenotypic variances vary across the studies that are pooled. We develop a procedure to compute variant-specific inflation factors, and show how it can be used for diagnosis of genetic association analyses on pooled individual level data from multiple studies. We describe a WGS-appropriate analysis approach, implemented in freely-available software, which allows study-specific variances and thereby improves performance in practice. We illustrate the variance stratification problem, its solutions, and the proposed diagnostic procedure, in simulations and in data from the Trans-Omics for Precision Medicine Whole Genome Sequencing Program (TOPMed), used in association tests for hemoglobin concentrations and BMI. Pooling participant-level genetic data into a single analysis can result in variance stratification, reducing statistical performance. Here, the authors develop variant-specific inflation factors to assess variance stratification and apply this to pooled individual-level data from whole genome sequencing.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available