4.6 Article

MLCM: Multi-Label Confusion Matrix

Journal

IEEE ACCESS
Volume 10, Issue -, Pages 19083-19095

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3151048

Keywords

Measurement; Classification algorithms; Motion pictures; Loss measurement; Libraries; Task analysis; TV; Classification performance; confusion matrix; machine learning; multi-class; multi-label

Ask authors/readers for more resources

In multi-label classification tasks, traditional confusion matrix is not applicable for performance assessment. Current assessment techniques calculate performance averages for evaluation, but suffer from ambiguity. To address this, we propose a method of creating multi-label confusion matrix based on problem categories and demonstrate its effectiveness through experiments.
Concise and unambiguous assessment of a machine learning algorithm is key to classifier design and performance improvement. In the multi-class classification task, where each instance can only be labeled as one class, the confusion matrix is a powerful tool for performance assessment by quantifying the classification overlap. However, in the multi-label classification task, where each instance can be labeled with more than one class, the confusion matrix is undefined. Performance assessment of the multi-label classifier is currently based on calculating performance averages, such as hamming loss, precision, recall, and F-score. While the current assessment techniques present a reasonable representation of each class and overall performance, their aggregate nature results in ambiguity when identifying false negative (FN) and false positive (FP) results. To address this gap, we define a method of creating the multi-label confusion matrix (MLCM) based on three proposed categories of multi-label problems. After establishing the shortcomings of current methods for identifying FN and FP, we demonstrate the usage of the MLCM with the classification of two publicly available multi-label data sets: i) a 12-lead ECG data set with nine classes, and ii) a movie poster data set with eighteen classes. A comparison of the MLCM results against statistics from the current techniques is presented to show the effectiveness in providing a concise and unambiguous understanding of a multi-label classifier behavior.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available