4.7 Article

MWFP-outlier: Maximal weighted frequent-pattern-based approach for detecting outliers from uncertain weighted data streams

Journal

INFORMATION SCIENCES
Volume 591, Issue -, Pages 195-225

Publisher

ELSEVIER SCIENCE INC
DOI: 10.1016/j.ins.2022.01.028

Keywords

Outlier detection; Maximal weighted frequent patterns; Uncertain weighted data streams; Deviation indices; Data mining

Funding

  1. National Key R&D Program of China [2020YFB1005500]
  2. National Nat-ural Science Foundation of China (NSFC) [U1836116, 62172194, 62102168]
  3. China PostdoctoralScience Foundation [2021M691310]
  4. Postdoctoral Science Foundation of Jiangsu Province [2021K636C, 2021K596C]
  5. Natural Science Foundation of the Jiangsu Higher Education Institutions [21KJB520031, 20KJB520031]
  6. Future Network Scientific Research Fund Project [FNSRFP-2021-YB-50]
  7. Leading-edge Technology Program of Jiangsu Natural Science Foundation [BK20202001]
  8. Natural Science Foundation of Jiangsu Province [BK20200888]

Ask authors/readers for more resources

The study proposes a new outlier detection approach that improves accuracy in handling uncertain data streams and effectively detects potential outliers. Through two phases, namely pattern mining and outlier detection phases, more accurate outlier detection is achieved. Experimental results demonstrate that this method is not only accurate but also consumes less time.
Many outlier detection approaches have been proposed for identifying previously unknown outliers, therefore improving the credibility of data. However, previous outlier detection approaches have some problems. First, most approaches were designed for static precise datasets, thus, their detection accuracy is very low when processing uncertain data streams. Second, these approaches considered the importance (aka weight) of each pattern is the same, which could not accurately reflect some actual situations in real life. To solve these problems, we propose an efficient maximal weighted frequent-pattern-based outlier detection approach, called MWFP-Outlier, for accurately detecting potential outliers from uncertain data streams through two phases, namely pattern mining phase and an outlier detection phase. In the pattern mining phase, through fully considering the existential probabilities and weights for each pattern, we propose the MWFP-Mine approach to accurately and efficiently mine maximal weighted frequent patterns based on the designed tree structure, list structure, and pruning strategies. In the outlier detection phase, we design four deviation indices to accurately measure the deviation degree of each transaction, and then the transactions in the top k ranked are identified as potential outliers. Extensive experimental results demonstrate that the MWFP-Outlier approach can accurately detect the outliers from uncertain weighted data streams, as well as uses less time consumption.(c) 2022 Elsevier Inc. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available