4.6 Article

Incremental Mining of High Utility Patterns in One Phase by Absence and Legacy-Based Pruning

期刊

IEEE ACCESS
卷 7, 期 -, 页码 74168-74180

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2019.2919524

关键词

Data mining; utility mining; high utility patterns; pattern mining; dynamic databases

资金

  1. Zhejiang Natural Science Foundation of China [LY17F020004]
  2. Key Projects of Zhejiang Science and Technology Plan of China [2018C01084]
  3. First Class Discipline of Zhejiang - A (Zhejiang Gongshang University- Statistics)
  4. National Natural Science Foundation of China [61272306]

向作者/读者索取更多资源

Mining high utility patterns in dynamic databases is an important data mining task. While a naive approach is to mine a newly updated database in its entirety, the state-of-the-art mining algorithms all take an incremental approach. However, the existing incremental algorithms either take a two-phase paradigm that generates a large number of candidates that causes scalability issues or employ a vertical data structure that incurs a large number of join operations that leads to efficiency issues. To address the challenges with the existing incremental algorithms, this paper proposes a new algorithm incremental direct discovery of high utility patterns (Id(2)HUP+). Id(2)HUP+ adapts a one-phase paradigm by improving the relevance-based pruning and upper-bound-based pruning proposes a novel data structure for a quick update of dynamic databases and proposes the absence-based pruning and legacy-based pruning dedicated to incremental mining. The extensive experiments show that our algorithm is up to 1-3 orders of magnitude more efficient than the state-of-the-art algorithms, and is the most scalable algorithm.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据