4.6 Article

Finding Stable Periodic-Frequent Itemsets in Big Columnar Databases

Journal

IEEE ACCESS
Volume 11, Issue -, Pages 12504-12524

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3241313

Keywords

Itemsets; Databases; Data mining; Behavioral sciences; Layout; Task analysis; Runtime; Columnar databases; stable periodic-frequent itemset; itemset mining

Ask authors/readers for more resources

Stable periodic-frequent itemset mining is important in big data analytics, and this paper proposes a framework to discover such itemsets in columnar databases. A novel depth-first search algorithm is employed to compress the columnar database into a unified dictionary and recursively mine it to find all stable periodic-frequent itemsets. Experimental results show that the proposed algorithm is computationally efficient and scalable.
Stable periodic-frequent itemset mining is essential in big data analytics with many real-world applications. It involves extracting all itemsets exhibiting stable periodic behaviors in a temporal database. Most previous studies focused on finding these itemsets in row (temporal) databases and disregarded the occurrences of these itemsets in columnar databases. Furthermore, the naive approach of transforming a columnar database into a row database and then applying the existing algorithms to find interesting itemsets is not practicable due to computational reasons. With this motivation, this paper proposes a framework to discover stable periodic-frequent itemsets in columnar databases. Our framework employs a novel depth-first search algorithm that compresses a given columnar database into a unified dictionary and mines it recursively to find all stable periodic-frequent itemsets. The dictionary holds the information pertaining to itemsets and their temporal occurrences in a database. Experimental results on six databases demonstrate that the proposed algorithm is computationally efficient and scalable.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available