☆ 4.5 Article

Beyond AP: a new evaluation index for multiclass classification task accuracy

APPLIED INTELLIGENCE (2021)

Journal

APPLIED INTELLIGENCE

Volume 51, Issue 10, Pages 7166-7176

Publisher

SPRINGER

DOI: 10.1007/s10489-021-02223-7

Keywords

Machine learning; Multiclassification; Evaluation index; R ' method

Funding

National Key Research and Development Program of China [2018YFB0204301]
Open Fund of PDL [6142110190201]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

The study introduces a new metric R' for multiclass classification tasks, providing both overall and individual evaluations to improve training processes and model selection.

Average precision (AP) and many other related evaluation indices have been employed ubiquitously in classification tasks for a long time. However, they have defects and can hardly provide both overall evaluations and individual evaluations. In practice, we have to strike a balance between whole and individual performances to satisfy diverse demands. To this end, we propose a new index for multiclass classification tasks, named R', which is an unbiased estimator of AP. Specifically, we improve the R index by taking the numerical differences between the real labels and predicted labels of each class into consideration. We evaluate its effectiveness and robustness on the MNIST and CIFAR-10 datasets. Experimental results show that it is positively correlated with some related indices. More importantly, we can obtain both overall and individual evaluations, which can be beneficial for improving training processes and model selection. Furthermore, as an evaluation architecture, the index can be promoted to evaluate any classification task, thereby implying broad application prospects.

Beyond AP: a new evaluation index for multiclass classification task accuracy

Journal

APPLIED INTELLIGENCE

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Beyond AP: a new evaluation index for multiclass classification task accuracy

Journal

APPLIED INTELLIGENCE

Publisher

SPRINGER

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper