4.5 Article

An algebraic semigroup method for discovering maximal frequent itemsets

Journal

OPEN MATHEMATICS
Volume 20, Issue 1, Pages 1432-1443

Publisher

DE GRUYTER POLAND SP Z O O
DOI: 10.1515/math-2022-0516

Keywords

maximal frequent itemset; association rule mining; generator; semigroup

Categories

Funding

  1. National Natural Science Foundation of China [11701370]

Ask authors/readers for more resources

This article presents a method for discovering maximal frequent itemsets with lower complexity and better performance compared to other algorithms. Additionally, the article explores algebraic properties, provides a recurrence formula, and analyzes the properties of maximal frequent itemsets.
Discovering maximal frequent itemsets is an important issue and key technique in many data mining problems such as association rule mining. In the literature, generating maximal frequent itemsets proves either to be NP-hard or to have O(l(3)4(l)(m + n)) complexity in the worst case from the perspective of generating maximal complete bipartite graphs of a bipartite graph, where m, n are the item number and the transaction number, respectively, and l denotes the maximum of vertical bar C parallel to Psi(C)vertical bar/vertical bar C vertical bar + vertical bar Psi(C)vertical bar - 1), with the maximum taken over all maximal frequent itemsets C. In this article, we put forward a method for discovering maximal frequent itemsets, whose complexity is O(3mn2(beta) + 4(beta)n), lower than the known complexity both in the worst case, from the perspective of semigroup algebra, where beta is the number of items whose support is more than the minimum support threshold. Experiments also show that an algorithm based on the algebraic method performs better than the other three well-known algorithms. Meanwhile, we explore some algebraic properties with respect to items and transactions, prove that the maximal frequent itemsets are exactly the simplified generators of frequent itemsets, give a necessary and sufficient condition for a maximal i + 1-frequent itemset being a subset of a closed i-frequent itemset, and provide a recurrence formula of maximal frequent itemsets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available