☆ 4.5 Article

Beyond AP: a new evaluation index for multiclass classification task accuracy

APPLIED INTELLIGENCE (2021)

期刊

APPLIED INTELLIGENCE

卷 51, 期 10, 页码 7166-7176

出版社

SPRINGER

DOI: 10.1007/s10489-021-02223-7

关键词

Machine learning; Multiclassification; Evaluation index; R ' method

类别

Computer Science, Artificial Intelligence

资金

National Key Research and Development Program of China [2018YFB0204301]
Open Fund of PDL [6142110190201]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

The study introduces a new metric R' for multiclass classification tasks, providing both overall and individual evaluations to improve training processes and model selection.

Average precision (AP) and many other related evaluation indices have been employed ubiquitously in classification tasks for a long time. However, they have defects and can hardly provide both overall evaluations and individual evaluations. In practice, we have to strike a balance between whole and individual performances to satisfy diverse demands. To this end, we propose a new index for multiclass classification tasks, named R', which is an unbiased estimator of AP. Specifically, we improve the R index by taking the numerical differences between the real labels and predicted labels of each class into consideration. We evaluate its effectiveness and robustness on the MNIST and CIFAR-10 datasets. Experimental results show that it is positively correlated with some related indices. More importantly, we can obtain both overall and individual evaluations, which can be beneficial for improving training processes and model selection. Furthermore, as an evaluation architecture, the index can be promoted to evaluate any classification task, thereby implying broad application prospects.

Beyond AP: a new evaluation index for multiclass classification task accuracy

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Beyond AP: a new evaluation index for multiclass classification task accuracy

期刊

APPLIED INTELLIGENCE

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文