4.6 Article

Do little interactions get lost in dark random forests?

Journal

BMC BIOINFORMATICS
Volume 17, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s12859-016-0995-8

Keywords

Random forests; Trees; Variable importance; Gene-gene interactions; Epistasis

Funding

  1. German Federal Ministry of Education and Research (BMBF) [01ZX1313A-2014]
  2. German Centre for Cardiovascular Research (DZHK
  3. Deutsches Zentrum fur Herz-Kreislauf-Forschung)
  4. European Union FP7 project BiomarCaRE [HEALTH-F2-2011-278913]

Ask authors/readers for more resources

Background: Random forests have often been claimed to uncover interaction effects. However, if and how interaction effects can be differentiated from marginal effects remains unclear. In extensive simulation studies, we investigate whether random forest variable importance measures capture or detect gene-gene interactions. With capturing interactions, we define the ability to identify a variable that acts through an interaction with another one, while detection is the ability to identify an interaction effect as such. Results: Of the single importance measures, the Gini importance captured interaction effects in most of the simulated scenarios, however, they were masked by marginal effects in other variables. With the permutation importance, the proportion of captured interactions was lower in all cases. Pairwise importance measures performed about equal, with a slight advantage for the joint variable importance method. However, the overall fraction of detected interactions was low. In almost all scenarios the detection fraction in a model with only marginal effects was larger than in a model with an interaction effect only. Conclusions: Random forests are generally capable of capturing gene-gene interactions, but current variable importance measures are unable to detect them as interactions. In most of the cases, interactions are masked by marginal effects and interactions cannot be differentiated from marginal effects. Consequently, caution is warranted when claiming that random forests uncover interactions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available