期刊
NATURE GENETICS
卷 42, 期 4, 页码 355-U118出版社
NATURE PUBLISHING GROUP
DOI: 10.1038/ng.546
关键词
-
资金
- US National Science Foundation (NSF) [DBI-0321467, DBI-0703908, DBI-0820619, DBI-06638566]
- US National Institutes of Health [1R21AR055228-01A1]
- National Heart, Lung, and Blood Institute [U 01 HL72524, U 01 HL54776, 5U01HL072524-06]
- US Department of Agriculture Research Service [53-K06-5-10, 58-1950-9-001]
- USDA-Cooperative State Research, Education and Extension Service National Research Initiative [2006-35300-17155]
- Morris Animal Foundation [D04CA-135]
- WALTHAM Centre for Pet Nutrition
- Cornell Advanced Technology in Biotechnology
- Cornell Veterinary College
- NATIONAL HEART, LUNG, AND BLOOD INSTITUTE [R01HL054776, U01HL072524] Funding Source: NIH RePORTER
- NATIONAL INSTITUTE OF ARTHRITIS AND MUSCULOSKELETAL AND SKIN DISEASES [R21AR055228] Funding Source: NIH RePORTER
Mixed linear model (MLM) methods have proven useful in controlling for population structure and relatedness within genome-wide association studies. However, MLM-based methods can be computationally challenging for large datasets. We report a compression approach, called 'compressed MLM', that decreases the effective sample size of such datasets by clustering individuals into groups. We also present a complementary approach, 'population parameters previously determined' (P3D), that eliminates the need to re-compute variance components. We applied these two methods both independently and combined in selected genetic association datasets from human, dog and maize. The joint implementation of these two methods markedly reduced computing time and either maintained or improved statistical power. We used simulations to demonstrate the usefulness in controlling for substructure in genetic association datasets for a range of species and genetic architectures. We have made these methods available within an implementation of the software program TASSEL.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据