4.7 Article

Maximum Ambiguity-Based Sample Selection in Fuzzy Decision Tree Induction

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2011.67

关键词

Learning; uncertainty; sample selection; fuzzy decision tree

资金

  1. National Natural Science Foundation of China [61170040]
  2. Natural Science Foundation of Hebei Province [F2008000635, F2012201023]
  3. key project of applied fundamental research of Hebei Province [08963522D]
  4. Education Department in Hebei Province

向作者/读者索取更多资源

Sample selection is to select a number of representative samples from a large database such that a learning algorithm can have a reduced computational cost and an improved learning accuracy. This paper gives a new sample selection mechanism, i.e., the maximum ambiguity-based sample selection in fuzzy decision tree induction. Compared with the existing sample selection methods, this mechanism selects the samples based on the principle of maximal classification ambiguity. The major advantage of this mechanism is that the adjustment of the fuzzy decision tree is minimized when adding selected samples to the training set. This advantage is confirmed via the theoretical analysis of the leaf-nodes' frequency in the decision trees. The decision tree generated from the selected samples usually has a better performance than that from the original database. Furthermore, experimental results show that generalization ability of the tree based on our selection mechanism is far more superior to that based on random selection mechanism.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据