4.0 Article

Symbolic modeling of epistasis

期刊

HUMAN HEREDITY
卷 63, 期 2, 页码 120-133

出版社

KARGER
DOI: 10.1159/000099184

关键词

data mining; gene-gene interaction; genetic programming; function mapping; symbolic discriminant analysis

资金

  1. EUNICE KENNEDY SHRIVER NATIONAL INSTITUTE OF CHILD HEALTH &HUMAN DEVELOPMENT [R01HD047447] Funding Source: NIH RePORTER
  2. NATIONAL CENTER FOR RESEARCH RESOURCES [P20RR018787] Funding Source: NIH RePORTER
  3. NATIONAL INSTITUTE OF ALLERGY AND INFECTIOUS DISEASES [R01AI059694] Funding Source: NIH RePORTER
  4. NATIONAL LIBRARY OF MEDICINE [R01LM009012] Funding Source: NIH RePORTER
  5. NCRR NIH HHS [RR018787] Funding Source: Medline
  6. NIAID NIH HHS [AI59694] Funding Source: Medline
  7. NICHD NIH HHS [HD047447] Funding Source: Medline
  8. NLM NIH HHS [LM009012] Funding Source: Medline

向作者/读者索取更多资源

The workhorse of modern genetic analysis is the parametric linear model. The advantages of the linear modeling framework are many and include a mathematical understanding of the model fitting process and ease of interpretation. However, an important limitation is that linear models make assumptions about the nature of the data being modeled. This assumption may not be realistic for complex biological systems such as disease susceptibility where nonlinearities in the genotype to phenotype mapping relationship that result from epistasis, plastic reaction norms, locus heterogeneity, and phenocopy, for example, are the norm rather than the exception. We have previously developed a flexible modeling approach called symbolic discriminant analysis (SDA) that makes no assumptions about the patterns in the data. Rather, SDA lets the data dictate the size, shape, and complexity of a symbolic discriminant function that could include any set of mathematical functions from a list of candidates supplied by the user. Here, we outline a new five step process for symbolic model discovery that uses genetic programming (GP) for coarse-grained stochastic searching, experimental design for parameter optimization, graphical modeling for generating expert knowledge, and estimation of distribution algorithms for fine-grained stochastic searching. Finally, we introduce function mapping as a new method for interpreting symbolic discriminant functions. We show that function mapping when combined with measures of interaction information facilitates statistical interpretation by providing a graphical approach to decomposing complex models to highlight synergistic, redundant, and independent effects of polymorphisms and their composite functions. We illustrate this five step SDA modeling process with a real case-control dataset. Copyright (c) 2007 S. Karger AG, Basel.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据