☆ 4.5 Article

(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

COMPUTATIONAL STATISTICS & DATA ANALYSIS (2014)

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

卷 71, 期 -, 页码 986-1000

出版社

ELSEVIER

DOI: 10.1016/j.csda.2013.08.007

关键词

Benchmark experiments; Data set characterization; Recursive partitioning; Preference scaling; Bradley-Terry model

类别

Computer Science, Interdisciplinary Applications Statistics & Probability

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

It is common knowledge that the performance of different learning algorithms depends on certain characteristics of the data such as dimensionality, linear separability or sample size. However, formally investigating this relationship in an objective and reproducible way is not trivial. A new formal framework for describing the relationship between data set characteristics and the performance of different learning algorithms is proposed. The framework combines the advantages of benchmark experiments with the formal description of data set characteristics by means of statistical and information-theoretic measures and with the recursive partitioning of Bradley-Terry models for comparing the algorithms' performances. The formal aspects of each component are introduced and illustrated by means of an artificial example. Its real-world usage is demonstrated with an application example consisting of thirteen widely-used data sets and six common learning algorithms. The Appendix provides information on the implementation and the usage of the framework within the R language. (C) 2013 Elsevier B.V. All rights reserved.

(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

(Psycho-)analysis of benchmark experiments: A formal framework for investigating the relationship between data sets and learning algorithms

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS

出版社

ELSEVIER

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文