期刊
PATTERN RECOGNITION LETTERS
卷 60-61, 期 -, 页码 57-64出版社
ELSEVIER
DOI: 10.1016/j.patrec.2015.03.002
关键词
Alternating decision tree; Decision tree; Boosting; Sparse discriminant analysis; Feature selection
资金
- Monash University Malaysia through a Higher Degree Research scholarship
- Malaysia Ministry of Higher Education Fundamental Research Grant Scheme [FRGS/1/2013/TK02/MUSM/02/1]
Alternating decision tree (ADTree) is a special decision tree representation that brings interpretability to boosting, a well-established ensemble algorithm. This has found success in wide applications. However, existing variants of ADTree are implementing univariate decision nodes where potential interactions between features are ignored. To date, there has been no multivariate ADTree. We propose a sparse version of multivariate ADTree such that it remains comprehensible. The proposed sparse ADTree is empirically tested on UCI datasets as well as spectral datasets from the University of Eastern Finland (UEF). We show that sparse ADTree is competitive against both univariate decision trees (original ADTree, C4.5, and CART) and multivariate decision trees (Fisher's decision tree and a single multivariate decision tree from oblique Random Forest). It achieves the best average rank in terms of prediction accuracy, second in terms of decision tree size and faster induction time than existing ADTree. In addition, it performs especially well on datasets with correlated features such as UEF spectral datasets. Thus, the proposed sparse ADTree extends the applicability of ADTree to a wider variety of applications. (C) 2015 Elsevier B.V. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据