☆ 4.7 Article

Labelling strategies for hierarchical multi-label classification techniques

PATTERN RECOGNITION (2016)

期刊

PATTERN RECOGNITION

卷 56, 期 -, 页码 170-183

出版社

ELSEVIER SCI LTD

DOI: 10.1016/j.patcog.2016.02.017

关键词

Hierarchical multi-label classification; Threshold optimisation; Hierarchical loss; HMC-loss; F-measure

类别

Computer Science, Artificial Intelligence Engineering, Electrical & Electronic

资金

Ghent University

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Many hierarchical multi-label classification systems predict a real valued score for every (instance, class) couple, with a higher score reflecting more confidence that the instance belongs to that class. These classifiers leave the conversion of these scores to an actual label set to the user, who applies a cut-off value to the scores. The predictive performance of these classifiers is usually evaluated using threshold independent measures like precision-recall curves. However, several applications require actual label sets, and thus an automatic labelling strategy. In this paper, we present and evaluate different alternatives to perform the actual labelling in hierarchical multi-label classification. We investigate the selection of both single and multiple thresholds. Despite the existence of multiple threshold selection strategies in non-hierarchical multi-label classification, they cannot be applied directly to the hierarchical context. The proposed strategies are implemented within two main approaches: optimisation of a certain performance measure of interest (such as F-measure or hierarchical loss), and simulating training set properties (such as class distribution or label cardinality) in the predictions. We assess the performance of the proposed labelling schemes on 10 datasets from different application domains. Our results show that selecting multiple thresholds may result in an efficient and effective solution for hierarchical multi-label problems. (C) 2016 Elsevier Ltd. All rights reserved.

Labelling strategies for hierarchical multi-label classification techniques

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Labelling strategies for hierarchical multi-label classification techniques

期刊

PATTERN RECOGNITION

出版社

ELSEVIER SCI LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文