☆ 4.2 Article

Interactive graphics for visually diagnosing forest classifiers in R

COMPUTATIONAL STATISTICS (2023)

期刊

COMPUTATIONAL STATISTICS

卷 -, 期 -, 页码 -

出版社

SPRINGER HEIDELBERG

DOI: 10.1007/s00180-023-01323-x

关键词

Statistical visualization; Interactive visualization; Interpretable machine learning; Ensemble model

类别

Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This article explores the use of interactive graphics to analyze forest classification models. By combining and analyzing the results from multiple decision trees, a forest classifier can provide insights into class structure in high dimensions. The article discusses various aspects of the models, including model complexity, variable importance, and prediction uncertainty. The methods described can be applied to random forest and other bagged ensemble models, helping to improve their interpretability. The graphics are built using the ggplot2, plotly, and shiny packages in R.

This article describes structuring data and constructing plots to explore forest classification models interactively. A forest classifier is an example of an ensemble since it is produced by bagging multiple trees. The process of bagging and combining results from multiple trees produces numerous diagnostics which, with interactive graphics, can provide a lot of insight into class structure in high dimensions. Various aspects of models are explored in this article, to assess model complexity, individual model contributions, variable importance and dimension reduction, and uncertainty in prediction associated with individual observations. The ideas are applied to the random forest algorithm and projection pursuit forest but could be more broadly applied to other bagged ensembles helping in the interpretability deficit of these methods. Interactive graphics are built in R using the ggplot2, plotly, and shiny packages.

Interactive graphics for visually diagnosing forest classifiers in R

期刊

COMPUTATIONAL STATISTICS

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Interactive graphics for visually diagnosing forest classifiers in R

期刊

COMPUTATIONAL STATISTICS

出版社

SPRINGER HEIDELBERG

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文