期刊
IEEE ACCESS
卷 8, 期 -, 页码 62762-62774出版社
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2020.2985255
关键词
Data mining; decision trees; interpretability; splitting criterion
资金
- Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Education [NRF-2019R1A6A1A03032119]
A new splitting criterion for classification trees that generates better decision rules in terms of interpretability is proposed in this paper. The criterion is designed to find homogeneous rules that describe a significant number of instances with a short length. The proposed criterion considers only one side of a split to generate highly homogeneous rules and concurrently utilizes a function of sample ratios with an adjustable hyperparameter to control the coverage of rules. The distinctive feature of the proposed method is that it is applied adaptively at every split. We also introduce an efficient heuristic algorithm to determine an appropriate hyperparameter value for every split. Experimental results evaluated over 17 benchmark datasets show that the proposed criterion combined with the proposed heuristic constructs a better interpretable decision tree. It is verified through quantitative and qualitative analysis that the constructed tree produces highly interpretable rules, and its predictive performance is comparable to that of other popular criteria.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据