4.7 Article

ISSP-tree: An improved fast algorithm for constructing a complete prefix tree using single database scan

期刊

EXPERT SYSTEMS WITH APPLICATIONS
卷 185, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2021.115603

关键词

Association rule; Frequent pattern; Incremental mining; Frequent itemset; SSP-tree; ISSP-tree; FP-tree; Rule mining; Data mining

资金

  1. Tezpur University, India
  2. Maulana Azad National Fellowship (MANF) , UGC

向作者/读者索取更多资源

Researchers have explored the frequent pattern mining problem by considering the accommodating of complete information in system main memory and the static nature of databases. They have proposed an efficient tree data structure called ISSP-tree that can handle updated databases and is adaptive for incremental and interactive mining using only one database scan.
The researchers have explored the frequent pattern mining problem by considering the fact that the complete set of information to be processed can be accommodated in systems main memory, and databases are static. However, any transactional or online database may get modified in real-life scenarios due to new transactions or deleting previous obsolete records. Moreover, the support threshold may get updated over time to generate a new set of frequent patterns from the updated database. An inefficient but straightforward method to deal with this problem is recomputing the fresh set of patterns for the updated database or updated support threshold. Most of the existing algorithms perform pattern mining using multiple database scans, which requires a massive amount of main memory and computational time to retain tedious candidate itemsets and prune out the unnecessary itemsets. The research community has developed a few methods to handle the incremental scenario without re-computation from scratch, and those methods are efficient in terms of database scan point of view. Although the approaches have solved the re-computation problem by constructing a complete pattern-tree data structure using only one database scan, they have significant issues such as massive disk I/O and colossal search space high tree construction time. Therefore, to improve the tree construction time, we propose an efficient tree data structure called ISSP-tree (Improved Single Scan Pattern Tree), which creates a complete tree to retain all the database transactions irrespective of the item frequencies using only one database scan. Moreover, the method is also adaptive to incremental and interactive mining.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据