4.2 Article

Variable Importance Assessment in Regression: Linear Regression versus Random Forest

期刊

AMERICAN STATISTICIAN
卷 63, 期 4, 页码 308-319

出版社

TAYLOR & FRANCIS INC
DOI: 10.1198/tast.2009.08199

关键词

Linear model; Random forest; Variable importance

向作者/读者索取更多资源

Relative importance of regressor variables is an old topic that still awaits a satisfactory solution. When interest is in attributing importance in linear regression, averaging over orderings methods for decomposing R-2 are among the state-of-the-art methods, although the mechanism behind their behavior is not (yet) completely understood. Random forests-a machine-learning tool for classification and regression proposed a few years ago-have an inherent procedure of producing variable importances. This article compares the two approaches (linear model on the one hand and two versions of random forests on the other hand) and finds both striking similarities and differences, some of which can be explained whereas others remain a challenge. The investigation improves understanding of the nature of variable importance in random forests. This article has supplementary material online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据