4.2 Article

H-Map-Based Technique for Mining High Average Utility Itemset

Journal

IETE JOURNAL OF RESEARCH
Volume -, Issue -, Pages -

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/03772063.2022.2075800

Keywords

Frequent itemset mining; High average utility itemset mining; High utility itemset mining; Itemset mining; Pruning strategy; Upper bounds

Ask authors/readers for more resources

High Average Utility Itemset mining addresses the limitations of HUIM by taking itemset length into account, improving utility estimation accuracy, enhancing processing efficiency with pruning algorithms, and reducing processing time through a multi-threaded parallel approach.
High Utility Itemset Mining (HUIM) is the process of locating itemsets that are profitable and useful to users. One of the key flaws in HUIM is that as the length of the itemset increases, the utility also increases. The true utility/profit of the itemset is not revealed in HUIM. High Average Utility Itemset mining overcomes the limitations of HUIM by taking the length of the itemset into account when estimating the utility. Existing pruning methods used for eliminating weak candidates overestimate the average usefulness of itemsets, causing the mining process to slow down. To prevent processing unpromising candidate itemsets and efficiently reduce the search space and processing time, the proposed methodology employs Upper Bound using Remaining Items Utility, Maximum Itemset Utility, and Sum of Maximum Utility in a Transaction. It also uses the multithreaded parallel approach to reduce the processing time. The H-Map-based data structure (H-Map) used for storing the utility values reduces the lookup time and joins used for itemset extension compared to existing state-of-the-art High Average Utility Itemset mining algorithms. The performance of the proposed work is evaluated in terms of memory usage and the time taken for processing. The proposed work increases the overall efficiency of the system employing effective pruning algorithms for pruning poor candidate itemsets and an efficient data structure for storing utility values.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available