4.7 Article

A Theory of Evidence-based method for assessing frequent patterns

Journal

EXPERT SYSTEMS WITH APPLICATIONS
Volume 40, Issue 8, Pages 3121-3127

Publisher

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.eswa.2012.12.030

Keywords

Frequent itemset mining; Theory of Evidence; Information measures; Uncertainty management

Funding

  1. Spanish Ministry of Science and Innovation [TIN2009-14372-C03-01]
  2. Junta de Andalucia (Andalusia Regional Government) Excellence Project [P07-SEJ-03214]

Ask authors/readers for more resources

Frequent itemset (or frequent pattern) mining is a very important issue within the data mining field. Both, syntactic simplicity and descriptive potential, are the key features of the itemset-based pattern which have led to its widespread use in a growing number of real-life domains. Some of the most representative algorithms for mining this kind of pattern are Apriori-like algorithms and, therefore, the number of patterns obtained under normal conditions is very large, making the process of evaluation and interpretation quite difficult. This problem is compounded if we consider that knowledge discovery is an iterative process, and the change in the parameters of the preprocessing techniques or the mining algorithm can lead to significant changes in the result. In this paper, we propose a method based on Shafer's Theory of Evidence which uses two information measures for the quality evaluation of the set of frequent patterns. From a practical point of view, the main goal is to select, for a given database, the best preprocessing technique that lead to the discovery of useful knowledge. Nevertheless, the underlying idea is to propose a formal method to assess, objectively, sets of frequent patterns, seen as belief structures, in terms of certainty in the information they represent. (C) 2012 Elsevier Ltd. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available