4.5 Article

An efficient utility-list based high-utility itemset mining algorithm

Journal

APPLIED INTELLIGENCE
Volume 53, Issue 6, Pages 6992-7006

Publisher

SPRINGER
DOI: 10.1007/s10489-022-03850-4

Keywords

Data mining; Pattern mining; High-utility itemset mining; Simplified utility-list

Ask authors/readers for more resources

High-utility itemset mining is an important task in data mining for retrieving meaningful patterns. Existing algorithms suffer from storage and time overheads. To address this, we propose an efficient algorithm based on simplified utility-list structure, which effectively reduces the number of candidates, memory usage, and execution time by introducing techniques like simplified utility-list, repeated pruning, and extension utility.
High-utility itemset mining (HUIM) is an important task in data mining that can retrieve more meaningful and useful patterns for decision-making. One-phase HUIM algorithms based on the utility-list structure have been shown to be the most efficient as they can mine high-utility itemsets (HUIs) without generating candidates. However, storing itemset information for the utility-list is time-consuming and memory consuming. To address this problem, we propose an efficient simplified utility-list-based HUIM algorithm (HUIM-SU). In the proposed HUIM-SU algorithm, the simplified utility-list is proposed to obtain all HUIs effectively and reduce memory usage in the depth-first search process. Based on the the simplified utility-list, repeated pruning according to the transaction-weighted utilisation (TWU) reduces the number of items. In addition, a construction tree and compressed storage are introduced to further reduce the search space and the memory usage. The extension utility and itemset TWU are then proposed to be the upper bounds, which reduce the search space considerably. Extensive experimental results on dense and sparse datasets indicate that the proposed HUIM-SU algorithm is highly efficient in terms of the number of candidates, memory usage, and execution time.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available