4.4 Article

Classification of histogram-valued data with support histogram machines

期刊

JOURNAL OF APPLIED STATISTICS
卷 50, 期 3, 页码 675-690

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/02664763.2021.1947996

关键词

Support vector machines; symbolic data; Wasserstein-Kantorovich metric

向作者/读者索取更多资源

This paper focuses on the classification problems when histograms are used as or aggregated into predictors. Conventional classification methods convert histograms into vector-valued data using summary values, which neglect the distributional information in histograms. To address this issue, the authors propose a margin-based classifier named support histogram machine (SHM) utilizing the support vector machine framework and the Wasserstein-Kantorovich metric. The experimental results demonstrate the superior performance of SHM compared to summary-value-based methods.
The current large amounts of data and advanced technologies have produced new types of complex data, such as histogram-valued data. The paper focuses on classification problems when predictors are observed as or aggregated into histograms. Because conventional classification methods take vectors as input, a natural approach converts histograms into vector-valued data using summary values, such as the mean or median. However, this approach forgoes the distributional information available in histograms. To address this issue, we propose a margin-based classifier called support histogram machine (SHM) for histogram-valued data. We adopt the support vector machine framework and the Wasserstein-Kantorovich metric to measure distances between histograms. The proposed optimization problem is solved by a dual approach. We then test the proposed SHM via simulated and real examples and demonstrate its superior performance to summary-value-based methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据