期刊
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE
卷 106, 期 -, 页码 -出版社
PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.engappai.2021.104477
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern mining; Uncertain data; Expected support
This study proposes an efficient method based on an upper bound approach to mine uncertain frequent patterns, reducing false positives significantly by tightening the upper bound of expected support and early pruning of infrequent 2-itemsets and their supersets.
In recent years, frequent pattern mining from uncertain data has been actively researched in data mining. There are numerous exact and upper bound-based approaches for uncertain frequent pattern mining. Exact-based algorithms may produce a large data structure and need time-consuming calculations and upper bound-based algorithms may produce many false positives. As a result, these algorithms demand much time and memory. There have been efforts to resolve the problem of upper bound-based algorithms, however, all of these methods only try to tighten the upper bound of expected support for long patterns. This is while pruning infrequent short patterns has a greater impact on reducing the false positives. To overcome these drawbacks, in this paper an efficient method based on upper bound is proposed for mining uncertain frequent patterns. The proposed method uses a new Tightened upper bound to expected support of patterns (Tup) which has a significant effect on reducing the number of false positives by tightening the upper bound of expected support and early pruning of infrequent 2-itemsets and their supersets. Comprehensive experimental results show that the proposed method reduces memory consumption in most cases and dramatically improves the performance of exact and upper bound-based methods in terms of runtime and scalability for dense and sparse uncertain data.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据