Journal
DATA MINING AND KNOWLEDGE DISCOVERY
Volume 34, Issue 1, Pages 101-123Publisher
SPRINGER
DOI: 10.1007/s10618-019-00657-9
Keywords
Combinatorial Boolean model; Sparse learning; Knowledge discovery; Frequent itemset mining
Funding
- JST KAKENHI [19H01114]
- JST-AIP [JPMJCR19U4]
Ask authors/readers for more resources
We consider the class of linear predictors over all logical conjunctions of binary attributes, which we refer to as the class of combinatorial binary models (CBMs) in this paper. CBMs are of high knowledge interpretability but naive learning of them from labeled data requires exponentially high computational cost with respect to the length of the conjunctions. On the other hand, in the case of large-scale datasets, long conjunctions are effective for learning predictors. To overcome this computational difficulty, we propose an algorithm, GRAfting for Binary datasets (GRAB), which efficiently learns CBMs within the regularized loss minimization framework. The key idea of GRAB is to adopt weighted frequent itemset mining for the most time-consuming step in the grafting algorithm, which is designed to solve large-scale RERM problems by an iterative approach. Furthermore, we experimentally showed that linear predictors of CBMs are effective in terms of prediction accuracy and knowledge discovery.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available