4.7 Article

An improved decision tree algorithm based on variable precision neighborhood similarity

期刊

INFORMATION SCIENCES
卷 615, 期 -, 页码 152-166

出版社

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.10.043

关键词

Decision tree algorithm; Neighborhood geometry similarity; Neighborhood algebraic similarity; Equivalence relation; Variable precision neighborhood rough set; Attribute dependence

资金

  1. National Natural Science Foundation of China [62166001, 61976158]
  2. Natural Science Foundation of Jiangxi Province, China [20202BAB202010]
  3. Graduate Innovation Funding Program of Gannan Normal University, China [YCX22A025]
  4. Project of Science and Technology of Education Department of Jiangxi Province [GJJ211407]

向作者/读者索取更多资源

The decision tree algorithm is widely used in data mining and machine learning due to its accuracy, low computational cost, and interpretability. However, when dealing with continuous data, the classical decision tree algorithm needs to discretize continuous attributes, which may result in a loss of information structure and affect the classification performance. To address this issue, this paper proposes a new decision tree algorithm based on variable precision neighborhood rough sets, considering the geometric structure of neighborhood systems to solve the contradiction in the transitivity of the equivalence relation.
The decision tree algorithm has been widely used in data mining and machine learning due to its high accuracy, low computational cost and high interpretability. However, when dealing with the continuous data, the classical decision tree algorithm needs to replace continuous attributes with discretized attributes by the strategy of discretization. Discretization may cause a loss of information structure, which will affect the performance of classification. To tackle this problem, many researchers have proposed different decision tree methods based on variable precision neighborhood rough sets. However, these methods do not consider the geometric structure of neighborhood systems, which may lead to a contradiction in the transitivity of the equivalence relation. In this paper, we first define a novel neighborhood geometric similarity in a neighborhood system from the perspective of geometry. Second, by combining the neighborhood geometric similarity and the neighborhood algebraic similarity, we propose four new kinds of neighborhood similarities, which can solve the contradictory transitivity of the equivalence relation. Third, a variable precision neighborhood rough set model is constructed using the new similarities, and a novel decision tree algorithm is proposed based on this model, where the degree of attribute dependence is used as the partition measure. Experimental results on 14 selected datasets from the UCI Machine Learning Repository show that our algorithm is effective. The average accuracy of our algorithm is over 90%, which is 10% higher than the classical decision tree algorithms, and the number of leaf nodes increases slightly. (c) 2022 Elsevier Inc. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据