4.6 Article

Multi-Objective Optimization for High-Dimensional Maximal Frequent Itemset Mining

Journal

APPLIED SCIENCES-BASEL
Volume 11, Issue 19, Pages -

Publisher

MDPI
DOI: 10.3390/app11198971

Keywords

association rules; frequent itemset mining; big data; multi-objective optimization; maximal frequent itemset

Funding

  1. Zhejiang Basic Public Welfare Research Plan Projects - Science Technology Department of Zhejiang Province, China [LGG19F030009, LGG19F020005]

Ask authors/readers for more resources

A multi-objective optimization algorithm was proposed for mining frequent itemsets in high-dimensional data, aiming to search for the maximal frequent itemset and achieve individual resurgency through recording and eliminating lethal gene fragments. The set of Pareto optimal solutions for frequent itemsets was ultimately obtained.
The solution space of a frequent itemset generally presents exponential explosive growth because of the high-dimensional attributes of big data. However, the premise of the big data association rule analysis is to mine the frequent itemset in high-dimensional transaction sets. Traditional and classical algorithms such as the Apriori and FP-Growth algorithms, as well as their derivative algorithms, are unacceptable in practical big data analysis in an explosive solution space because of their huge consumption of storage space and running time. A multi-objective optimization algorithm was proposed to mine the frequent itemset of high-dimensional data. First, all frequent 2-itemsets were generated by scanning transaction sets based on which new items were added in as the objects of population evolution. Algorithms aim to search for the maximal frequent itemset to gather more non-void subsets because non-void subsets of frequent itemsets are all properties of frequent itemsets. During the operation of algorithms, lethal gene fragments in individuals were recorded and eliminated so that individuals may resurge. Finally, the set of the Pareto optimal solution of the frequent itemset was gained. All non-void subsets of these solutions were frequent itemsets, and all supersets are non-frequent itemsets. Finally, the practicability and validity of the proposed algorithm in big data were proven by experiments.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available