4.3 Article

An efficient hash map based technique for mining high average utility itemset

Journal

Publisher

SPRINGER INDIA
DOI: 10.1007/s12046-022-01997-x

Keywords

High utility itemset mining; high average utility itemset mining; upper bounds; pruning strategy

Ask authors/readers for more resources

HAUI mining improves upon HUI mining by using average utility to find itemsets more efficiently, resulting in faster processing time and reduced memory usage.
High Average Utility Itemset (HAUI) mining is an improvement on High-Utility Itemset (HUI) mining widely used in various pattern mining applications. The utility measure is proportional to the length of the itemset, which is a key flaw in HUI mining. HAUI finds the itemsets by relating the usefulness of itemsets to their length using an unbiased measure termed average utility. Pruning methods such as average-utility upper bound, revised tighter upper bound, and looser upper bound used to eliminate weak candidates, overestimates the average usefulness of itemsets, causing the mining process to slow down. In the proposed methodology, Upper Bound using Remaining Items Utility(UBRIU), Maximum Itemset Utility(MIU) and Sum of Maximum Utility in a Transaction(SMUT) are used to avoid processing unpromising candidate itemsets and efficiently minimise the search space and therefore the processing time. UBRIU value is used to check if the itemset can be extended or not. The key-value mapping structure used for storing the utility values reduces the lookup time compared to existing IL, IDUL structure. The performance of the proposed work is evaluated in terms of memory usage and the time taken for processing. The proposed algorithm is significantly faster than existing state-of-the-art HAUI mining algorithms and utilizes significantly less memory, according to experimental results. The proposed work increases the overall efficiency of the system by employing effective pruning algorithms for pruning poor candidate itemsets and an efficient data structure for storing utility values.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available