☆ 4.5 Article

Random forest for ordinal responses: Prediction and variable selection

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2016)

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

卷 96, 期 -, 页码 57-73

出版社

ELSEVIER

DOI: 10.1016/j.csda.2015.10.005

关键词

Random forest; Ordinal regression trees; Ordinal response; Prediction; Feature selection; Variable importance

类别

Computer Science, Interdisciplinary Applications Statistics & Probability

资金

German Science Foundation [BO3139/6-1, BO3139/2-2]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

The random forest method is a commonly used tool for classification with high dimensional data that is able to rank candidate predictors through its inbuilt variable importance measures. It can be applied to various kinds of regression problems including nominal, metric and survival response variables. While classification and regression problems using random forest methodology have been extensively investigated in the past, in the case of ordinal response there is no standard procedure. Extensive studies using random forest based on conditional inference trees are conducted to explore whether incorporating the ordering information yields any improvement in both prediction performance or variable selection. Two novel permutation variable importance measures are presented that are reasonable alternatives to the currently implemented importance measure which was developed for nominal response and makes no use of the ordering in the levels of an ordinal response variable. Results based on simulated and real data suggest that predictor rankings can be improved in some settings by using new permutation importance measures that explicitly use the ordering in the response levels in combination with ordinal regression trees. With respect to prediction accuracy, the performance of ordinal regression trees was similar to and in most settings even slightly better than that of classification trees. (C) 2015 Elsevier B.V. All rights reserved.

Random forest for ordinal responses: Prediction and variable selection

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Random forest for ordinal responses: Prediction and variable selection

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文