4.5 Article

Indexed list-based high utility pattern mining with utility upper-bound reduction and pattern combination techniques

期刊

KNOWLEDGE AND INFORMATION SYSTEMS
卷 51, 期 2, 页码 627-659

出版社

SPRINGER LONDON LTD
DOI: 10.1007/s10115-016-0989-x

关键词

High utility patterns; Indexed list; Pattern mining; Utility mining

资金

  1. National Research Foundation of Korea (NRF) - Ministry of Education, Science and Technology (NRF) [20152062051, 20155054624]
  2. National Research Foundation of Korea [2015R1D1A1A02062051, 2015R1A2A1A15054624] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

High utility pattern mining has been studied as an essential topic in the field of pattern mining in order to satisfy requirements of many real-world applications that need to process non-binary databases including item importance such as market analysis. In this paper, we propose an efficient algorithm with a novel indexed list-based data structure for mining high utility patterns. Previous approaches first generate an enormous number of candidate patterns on the basis of overestimation methods in their mining processes and then identify actual high utility patterns from the candidates through an additional database scan, which leads to high computational overheads. Although several list-based algorithms to discover high utility patterns without candidate generation have been suggested in recent years, they require a large number of comparison operations. Our method facilitates efficient mining of high utility patterns with the proposed indexed list by effectively reducing the total number of such operations. Moreover, we develop two techniques based on this novel data structure to more enhance mining performance of the proposed method. Experimental results on real and synthetic datasets show that the proposed algorithm mines high utility patterns more efficiently than the state-of-the-art algorithms.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据