4.3 Article

Entropy-Based Information Gain Approaches to Detect and to Characterize Gene-Gene and Gene-Environment Interactions/Correlations of Complex Diseases

Journal

GENETIC EPIDEMIOLOGY
Volume 35, Issue 7, Pages 706-721

Publisher

WILEY
DOI: 10.1002/gepi.20621

Keywords

gene-gene and gene-environment interactions; entropy; mutual information; interaction information; total correlation information

Funding

  1. National Cancer Institute [R01-CA133996]
  2. NIH [LM009012]
  3. Intergovernmental Personnel Act (IPA)
  4. National Natural Science Foundation [10901135]
  5. Natural Science Foundation of Yunnan Province [2008CD081, 2010CC003]
  6. Natural Science Foundation of Yunnan Province, P. R. China [2008CD081, 2010CC003]

Ask authors/readers for more resources

For complex diseases, the relationship between genotypes, environment factors, and phenotype is usually complex and nonlinear. Our understanding of the genetic architecture of diseases has considerably increased over the last years. However, both conceptually and methodologically, detecting gene-gene and gene-environment interactions remains a challenge, despite the existence of a number of efficient methods. One method that offers great promises but has not yet been widely applied to genomic data is the entropy-based approach of information theory. In this article, we first develop entropy-based test statistics to identify two-way and higher order gene-gene and gene-environment interactions. We then apply these methods to a bladder cancer data set and thereby test their power and identify strengths and weaknesses. For two-way interactions, we propose an information gain (IG) approach based on mutual information. For three-ways and higher order interactions, an interaction IG approach is used. In both cases, we develop one-dimensional test statistics to analyze sparse data. Compared to the naive chi-square test, the test statistics we develop have similar or higher power and is robust. Applying it to the bladder cancer data set allowed to investigate the complex interactions between DNA repair gene single nucleotide polymorphisms, smoking status, and bladder cancer susceptibility. Although not yet widely applied, entropy-based approaches appear as a useful tool for detecting gene-gene and gene-environment interactions. The test statistics we develop add to a growing body methodologies that will gradually shed light on the complex architecture of common diseases. Genet. Epidemiol. 35:706-721, 2011. (C) 2011 Wiley Periodicals, Inc.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available