Journal
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES
Volume 34, Issue 6, Pages 3040-3054Publisher
ELSEVIER
DOI: 10.1016/j.jksuci.2020.03.015
Keywords
Decision tree; Music rhythm tree; Class imbalance; Vertical partitioning
Categories
Ask authors/readers for more resources
Decision tree is a widely used non-parametric technique in machine learning, data mining and pattern recognition. This study proposes a novel vertical partitioning technique based on the ideas of music rhythm tree, which shows superior performance in terms of stability and handling of class-imbalanced data.
Decision tree is a widely used non-parametric technique in machine learning, data mining and pattern recognition. It is simple to understand and interpret, however it faces challenges such as handling higher dimensional and class imbalanced datasets, over-fitting and instability. To overcome some of these issues, vertical partitioning approaches like serial partitioning, theme based partitioning are used in the literature. A vertical partitioning approach divides the feature set into subsets of features (blocks) and makes use of these subsets for subsequent tasks. In this work, we use the ideas of music rhythm tree to propose a novel vertical partitioning technique. It orders the features based on the average correlation strength of the features before partitioning the feature set. The proposed method is proved to be superior by showing an average of 13.8%, 6%, 9.8%, 19.7%, 9.4%, and 29.4% higher classification accuracy over C4.5, Random Forest, Bagging, Adaboost, an ensemble technique and a vertical partitioning technique respectively. Our empirical results on 15 datasets demonstrate that the proposed vertical partitioning method is more stable and better in handling class-imbalanced data. Finally, some popular statistical tests are conducted to validate the statistical significance of the results of the proposed method. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available