4.7 Article

FT4cip: A new functional tree for classification in class imbalance problems

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 252, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2022.109294

Keywords

Decision trees; Functional trees; Model trees; Supervised classification

Funding

  1. National Council of Science and Technology of Mexico under a CONACYT Scholarship Grant

Ask authors/readers for more resources

Decision trees are popular classifiers known for their good performance, interpretability, and use in ensembles. This paper introduces a new functional tree, FT4cip, designed to address the issue of class imbalance in databases. The experimental results demonstrate that FT4cip outperforms the best model tree and functional tree in terms of AUC. The study also includes meta-analysis and comparison with other models to recommend classifiers based on different database types.
Decision trees (DTs) are popular classifiers partly due to their reasonably good classification per-formance, their ease of interpretation, and their widespread use in ensembles. To improve the classification performance of individual DTs, researchers have used linear combinations of features in inner nodes (Multivariate decision trees), leaf nodes (Model trees), or both (Functional trees). In this paper, we present a new functional tree, Functional Tree for class imbalance problems (FT4cip). FT4cip is designed to work with class imbalance problems, where one of the classes in the database has few objects compared to another class. FT4cip achieves better classification performance, in terms of AUC, than the best model tree (LMT) and functional tree (Gama) that we identified. The statistical comparison was made in 110 databases using Bayesian statistical tests. We also make a meta-analysis of classification performance per type of database, which helps us recommend a classifier given a problem. We show how each design decision taken when building FT4cip contributes to classification performance or simple models, and rank them according to their importance to classification performance. To avoid a problem of fragmentation in DT literature, we contrast each design decision taken when building FT4cip against LMT and Gama.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available