4.5 Article

An improved data characterization method and its application in classification algorithm recommendation

Journal

APPLIED INTELLIGENCE
Volume 43, Issue 4, Pages 892-912

Publisher

SPRINGER
DOI: 10.1007/s10489-015-0689-3

Keywords

Classification algorithm recommendation; Classification; Data set characteristics extraction

Funding

  1. China Postdoctoral Science Foundation [2014M562417]
  2. National Natural Science Foundation of China [61402355]

Ask authors/readers for more resources

Picking up appropriate classification algorithms for a given data set is very important and useful in practice. One of the most challenging issues for algorithm selection is how to characterize different data sets. Recently, we extracted the structural information of a data set to characterize itself. Although these kinds of characteristics work well in identifying similar data sets and recommending appropriate classification algorithms, the extraction method can only be applied to binary data sets and its performance is not high. Thus, in this paper, an improved data set characterization method is proposed to address these problems. For the purpose of evaluating the effectiveness of the improved method on algorithm recommendation, the unsupervised learning method EM is employed to build the algorithm recommendation model. Extensive experiments with 17 different types of classification algorithms are conducted upon 84 public UCI data sets; the results demonstrate the effectiveness of the proposed method.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available