4.7 Article

Finding representative patterns with ordered projections

期刊

PATTERN RECOGNITION
卷 36, 期 4, 页码 1009-1018

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/S0031-3203(02)00119-X

关键词

data mining; preprocessing techniques; pattern analysis; axis-parallel classifiers

向作者/读者索取更多资源

This paper presents a new approach to finding representative patterns for dataset editing. The algorithm patterns by ordered projections (POP), has some interesting characteristics: important reduction of the number of instances from the dataset; lower computational cost (Theta(mn log n)) with respect to other typical algorithms due to the absence of distance calculations; conservation of the decision boundaries, especially from the point of view of the application of axis-parallel classifiers. POP works well in practice with both continuous and discrete attributes. The performance of POP is analysed in two ways: percentage of reduction and classification. POP has been compared to IB2, ENN and SHRINK concerning the percentage of reduction and the computational cost. In addition, we have analysed the accuracy of k-NN and C4.5 after applying the reduction techniques. An extensive empirical study using datasets with continuous and discrete attributes from the UCI repository shows that POP is a valuable preprocessing method for the later application of any axis-parallel learning algorithm. (C) 2002 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据