4.2 Article

Interactive graphics for visually diagnosing forest classifiers in R

期刊

COMPUTATIONAL STATISTICS
卷 -, 期 -, 页码 -

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s00180-023-01323-x

关键词

Statistical visualization; Interactive visualization; Interpretable machine learning; Ensemble model

向作者/读者索取更多资源

This article explores the use of interactive graphics to analyze forest classification models. By combining and analyzing the results from multiple decision trees, a forest classifier can provide insights into class structure in high dimensions. The article discusses various aspects of the models, including model complexity, variable importance, and prediction uncertainty. The methods described can be applied to random forest and other bagged ensemble models, helping to improve their interpretability. The graphics are built using the ggplot2, plotly, and shiny packages in R.
This article describes structuring data and constructing plots to explore forest classification models interactively. A forest classifier is an example of an ensemble since it is produced by bagging multiple trees. The process of bagging and combining results from multiple trees produces numerous diagnostics which, with interactive graphics, can provide a lot of insight into class structure in high dimensions. Various aspects of models are explored in this article, to assess model complexity, individual model contributions, variable importance and dimension reduction, and uncertainty in prediction associated with individual observations. The ideas are applied to the random forest algorithm and projection pursuit forest but could be more broadly applied to other bagged ensembles helping in the interpretability deficit of these methods. Interactive graphics are built in R using the ggplot2, plotly, and shiny packages.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据