4.7 Article

Multi-label classification with weighted classifier selection and stacked ensemble

Journal

INFORMATION SCIENCES
Volume 557, Issue -, Pages 421-442

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2020.06.017

Keywords

Multi-label classification; Stacked ensemble; Label correlation; Base classifier selection; Regularization via sparsity

Funding

  1. Natural Science Foundation of China [61663046, 61876166]
  2. Program for Excellent Young Talents of Yunnan University
  3. Yunnan provincial major science and technology special plan projects: digitization research and application demonstration of Yunnan characteristic industry [202002AD080001]

Ask authors/readers for more resources

In this study, a novel stacked ensemble approach that simultaneously exploits label correlations and the process of learning weights of ensemble members is proposed. Experimental results demonstrate that the proposed algorithm outperforms related state-of-the-art methods in multi-label classification tasks.
Multi-label classification has attracted increasing attention in various applications, such as medical diagnosis and semantic annotation. With such trend, a large number of ensemble approaches have been proposed for multi-label classification tasks. Most of these approaches construct the ensemble members by using bagging schemes, but few stacked ensemble approaches are developed. Existing research on stacked ensemble approaches remains active, but several issues remain such as (1) little has been done to learn the weights of classifiers for classifier selection; (2) the relationship between pairwise label correlations and multi-label classification performance has not been investigated sufficiently. To address these issues, we propose a novel stacked ensemble approach that simultaneously exploits label correlations and the process of learning weights of ensemble members. In our approach, first, a weighted stacked ensemble with sparsity regularization is developed to facilitate classifier selection and ensemble members construction for multilabel classification. Second, in order to improve the classification performance, the pairwise label correlations are further considered for determining weights of these ensemble members. Finally, we develop an optimization algorithm based on both of the accelerated proximal gradient and the block coordinate descent techniques to achieve the optimal ensemble solution efficiently. Extensive experiments on publicly available datasets and real Cardiovascular and Cerebrovascular Disease datasets demonstrate that our proposed algorithm outperforms related state-of-the-art methods from perspectives of benchmarking and real-world applications. (C) 2020 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available