4.8 Article

Music rhythm tree based partitioning approach to decision tree classifier

Publisher

ELSEVIER
DOI: 10.1016/j.jksuci.2020.03.015

Keywords

Decision tree; Music rhythm tree; Class imbalance; Vertical partitioning

Ask authors/readers for more resources

Decision tree is a widely used non-parametric technique in machine learning, data mining and pattern recognition. This study proposes a novel vertical partitioning technique based on the ideas of music rhythm tree, which shows superior performance in terms of stability and handling of class-imbalanced data.
Decision tree is a widely used non-parametric technique in machine learning, data mining and pattern recognition. It is simple to understand and interpret, however it faces challenges such as handling higher dimensional and class imbalanced datasets, over-fitting and instability. To overcome some of these issues, vertical partitioning approaches like serial partitioning, theme based partitioning are used in the literature. A vertical partitioning approach divides the feature set into subsets of features (blocks) and makes use of these subsets for subsequent tasks. In this work, we use the ideas of music rhythm tree to propose a novel vertical partitioning technique. It orders the features based on the average correlation strength of the features before partitioning the feature set. The proposed method is proved to be superior by showing an average of 13.8%, 6%, 9.8%, 19.7%, 9.4%, and 29.4% higher classification accuracy over C4.5, Random Forest, Bagging, Adaboost, an ensemble technique and a vertical partitioning technique respectively. Our empirical results on 15 datasets demonstrate that the proposed vertical partitioning method is more stable and better in handling class-imbalanced data. Finally, some popular statistical tests are conducted to validate the statistical significance of the results of the proposed method. (C) 2020 The Authors. Published by Elsevier B.V. on behalf of King Saud University.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available