4.6 Article

Finding Stable Periodic-Frequent Itemsets in Big Columnar Databases

期刊

IEEE ACCESS
卷 11, 期 -, 页码 12504-12524

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2023.3241313

关键词

Itemsets; Databases; Data mining; Behavioral sciences; Layout; Task analysis; Runtime; Columnar databases; stable periodic-frequent itemset; itemset mining

向作者/读者索取更多资源

Stable periodic-frequent itemset mining is important in big data analytics, and this paper proposes a framework to discover such itemsets in columnar databases. A novel depth-first search algorithm is employed to compress the columnar database into a unified dictionary and recursively mine it to find all stable periodic-frequent itemsets. Experimental results show that the proposed algorithm is computationally efficient and scalable.
Stable periodic-frequent itemset mining is essential in big data analytics with many real-world applications. It involves extracting all itemsets exhibiting stable periodic behaviors in a temporal database. Most previous studies focused on finding these itemsets in row (temporal) databases and disregarded the occurrences of these itemsets in columnar databases. Furthermore, the naive approach of transforming a columnar database into a row database and then applying the existing algorithms to find interesting itemsets is not practicable due to computational reasons. With this motivation, this paper proposes a framework to discover stable periodic-frequent itemsets in columnar databases. Our framework employs a novel depth-first search algorithm that compresses a given columnar database into a unified dictionary and mines it recursively to find all stable periodic-frequent itemsets. The dictionary holds the information pertaining to itemsets and their temporal occurrences in a database. Experimental results on six databases demonstrate that the proposed algorithm is computationally efficient and scalable.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据