4.7 Article

Explaining the performance of multilabel classification methods with data set properties

Journal

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS
Volume 37, Issue 9, Pages 6080-6122

Publisher

WILEY-HINDAWI
DOI: 10.1002/int.22835

Keywords

hyperparameter tuning; machine learning; meta knowledge; meta learning; multilabel classification

Funding

  1. Javna agencija za raziskovalno dejavnost rs [J2-9230, P50093, V5-1930, P2-0103]
  2. European Commission [952215]

Ask authors/readers for more resources

In this study, a comprehensive meta-learning analysis of data sets and methods for multilabel classification was conducted. The results showed that meta features describing the label space were the most important, and meta features describing label relationships occurred more frequently than those describing label distributions. Furthermore, optimizing hyperparameters can improve predictive performance, although the extent of improvement may not always be justified by resource utilization.
Meta learning generalizes the empirical experience with different learning tasks and holds promise for providing important empirical insight into the behavior of machine learning algorithms. In this paper, we present a comprehensive meta-learning study of data sets and methods for multilabel classification (MLC). MLC is a practically relevant machine learning task where each example is labeled with multiple labels simultaneously. Here, we analyze 40 MLC data sets by using 50 meta features describing different properties of the data. The main findings of this study are as follows. First, the most prominent meta features that describe the space of MLC data sets are the ones assessing different aspects of the label space. Second, the meta models show that the most important meta features describe the label space, and, the meta features describing the relationships among the labels tend to occur a bit more often than the meta features describing the distributions between and within the individual labels. Third, the optimization of the hyperparameters can improve the predictive performance, however, quite often the extent of the improvements does not always justify the resource utilization.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available