☆ 4.7 Article

Empirical comparison of tree ensemble variable importance measures

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS (2011)

Journal

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

Volume 105, Issue 2, Pages 157-170

Publisher

ELSEVIER

DOI: 10.1016/j.chemolab.2010.12.004

Keywords

Decision trees; Ensemble learning; Random forests; Conditional inference forests; Boosted trees; Variable importance; Fault identification

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Tree ensembles are becoming well-established as popular and powerful data modelling techniques. Tree ensemble models are essentially black box models, although their individual members may not be, and with their growing popularity, interest in the interpretation of tree ensemble models has also grown. This study presents variable importance measures associated with random forests, conditional inference forests and boosted trees, and employs a number of simulated data sets to compare these methods. Overall, variable importance indicators based on bagged conditional inference forests appear to strike a good balance between identification of significant variables and avoiding unnecessary flagging of correlated variables. Data preprocessing and interpretation by experts knowledgeable with a specific data set remain vital. (C) 2010 Elsevier B.V. All rights reserved.

Empirical comparison of tree ensemble variable importance measures

Journal

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Empirical comparison of tree ensemble variable importance measures

Journal

CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS

Publisher

ELSEVIER

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper