4.3 Article

Regression and data mining methods for analyses of multiple rare variants in the Genetic Analysis Workshop 17 mini-exome data

期刊

GENETIC EPIDEMIOLOGY
卷 35, 期 -, 页码 S92-S100

出版社

WILEY
DOI: 10.1002/gepi.20657

关键词

rare variants; LASSO; machine learning; random forests; logic regression; binary trees; Poisson regression; ISIS; classification trees; meta-analysis; extreme sampling

资金

  1. National Institutes of Health (NIH) [R01 GM031575]
  2. National Human Genome Research Institute, NIH [T32 MH-14235, R21 DA033827]
  3. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [ZIAHG000153] Funding Source: NIH RePORTER
  4. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM031575] Funding Source: NIH RePORTER
  5. NATIONAL INSTITUTE OF MENTAL HEALTH [T32MH014235] Funding Source: NIH RePORTER
  6. NATIONAL INSTITUTE ON DRUG ABUSE [R21DA033827] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Group 14 of Genetic Analysis Workshop 17 examined several issues related to analysis of complex traits using DNA sequence data. These issues included novel methods for analyzing rare genetic variants in an aggregated manner (often termed collapsing rare variants), evaluation of various study designs to increase power to detect effects of rare variants, and the use of machine learning approaches to model highly complex heterogeneous traits. Various published and novel methods for analyzing traits with extreme locus and allelic heterogeneity were applied to the simulated quantitative and disease phenotypes. Overall, we conclude that power is (as expected) dependent on locus-specific heritability or contribution to disease risk, large samples will be required to detect rare causal variants with small effect sizes, extreme phenotype sampling designs may increase power for smaller laboratory costs, methods that allow joint analysis of multiple variants per gene or pathway are more powerful in general than analyses of individual rare variants, population-specific analyses can be optimal when different subpopulations harbor private causal mutations, and machine learning methods may be useful for selecting subsets of predictors for follow-up in the presence of extreme locus heterogeneity and large numbers of potential predictors. Genet. Epidemiol. 35:S92S100, 2011. (C) 2011 Wiley Periodicals, Inc.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据