4.5 Article

Hole or Grain? A Section Pursuit Index for Finding Hidden Structure in Multiple Dimensions

期刊

出版社

TAYLOR & FRANCIS INC
DOI: 10.1080/10618600.2022.2035230

关键词

Data science; Data visualization; Dimension reduction; Exploratory data analysis; Multivariate data; Projection pursuit; Statistical graphics

资金

  1. Australian Research Council

向作者/读者索取更多资源

Multivariate data is often visualized using linear projections, but projections can obscure low and high density regions near the center. This article introduces a section pursuit method to reveal these regions through optimized slices and dynamic structure search. The method is useful for non-uniform or non-normal data distributions, and shows two applications.
Multivariate data is often visualized using linear projections, produced by techniques such as principal component analysis, linear discriminant analysis, and projection pursuit. A problem with projections is that they obscure low and high density regions near the center of the distribution. Sections, or slices, can help to reveal them. This article develops a section pursuit method, building on the extensive work in projection pursuit, to search for interesting slices of the data. Linear projections are used to define sections of the parameter space, and to calculate interestingness by comparing the distribution of observations, inside and outside a section. By optimizing this index, it is possible to reveal features such as holes (low density) or grains (high density). The optimization is incorporated into a guided tour so that the search for structure can be dynamic. The approach can be useful for problems when data distributions depart from uniform or normal, as in visually exploring nonlinear manifolds, and functions in multivariate space. Two applications of section pursuit are shown: exploring decision boundaries from classification models, and exploring subspaces induced by complex inequality conditions from a multiple parameter model. The new methods are available in R, in the tourr package. for this article are available online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据