☆ 4.2 Article

A New Approximate Method For Mining Frequent Itemsets From Big Data *

COMPUTER SCIENCE AND INFORMATION SYSTEMS (2021)

Journal

COMPUTER SCIENCE AND INFORMATION SYSTEMS

Volume 18, Issue 3, Pages 641-656

Publisher

COMSIS CONSORTIUM

DOI: 10.2298/CSIS200124015V

Keywords

Approximation Method; Frequent Itemsets Mining; Random Sample Partition; Big Transactional Database

Funding

National Natural Science Foundation of China [61972261]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Frequent itemsets mining is a critical step in finding association rules from transaction databases, and various efficient algorithms have been proposed for this task.

Frequent itemsets mining is the first and most critical step of finding association rules from a transaction database. Association rules mining is one of the main data mining tasks in many applications, such as basket analysis [3], product recommendation [20], crossselling [10], etc. Huge research efforts have been devoted to solving frequent itemsets mining problem. Many of these studies had considerable impact and led to a plenty of sophisticated and efficient algorithms for association rules mining, such as Apriori [1,2], Mining frequent itemsets in transaction databases is an important task in many applications. It becomes more challenging when dealing with a large transaction database because traditional algorithms are not scalable due to the limited main memory. In this paper, we propose a new approach for the approximately mining of frequent itemsets in a big transaction database. Our approach is suitable for mining big transaction databases since it uses the frequent itemsets from a subset of the entire database to approximate the result of the whole data, and can be implemented in a distributed environment. Our algorithm is able to efficiently produce high-accurate results, however it misses some true frequent itemsets. To address this problem and reduce the number of false negative frequent itemsets we introduce an additional parameter to the algorithm to discover most of the frequent itemsets contained in the entire data set. In this article, we show an empirical evaluation of the results of the proposed approach.

A New Approximate Method For Mining Frequent Itemsets From Big Data *

Journal

COMPUTER SCIENCE AND INFORMATION SYSTEMS

Publisher

COMSIS CONSORTIUM

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

A New Approximate Method For Mining Frequent Itemsets From Big Data *

Journal

COMPUTER SCIENCE AND INFORMATION SYSTEMS

Publisher

COMSIS CONSORTIUM

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper