4.7 Article

A relevant subspace based contextual outlier mining algorithm

期刊

KNOWLEDGE-BASED SYSTEMS
卷 99, 期 -, 页码 1-9

出版社

ELSEVIER SCIENCE BV
DOI: 10.1016/j.knosys.2016.01.013

关键词

Contextual outlier; Relevant subspace; Interpretability and comprehensibility; Local sparsity; Probability density

资金

  1. National Natural Science Foundation of PR China [61572343]
  2. U.S. National Science Foundation [CCF-0845257]

向作者/读者索取更多资源

For high-dimensional and massive data sets, a relevant subspace based contextual outlier detection algorithm is proposed. Firstly, the relevant subspace, which can effectively describe the local distribution of the various data sets, is redefined by using local sparseness of attribute dimensions. Secondly, a local outlier factor calculation formula in the relevant subspace is defined with probability density of local data sets, and the formula can effectively reflect the outlier degree of data object that does not obey the distribution of the local data set in the relevant subspace. Thirdly, attribute dimensions of constituting the relevant subspace and local outlier factor are defined as the contextual information, which can improve the interpretability and comprehensibility of outlier. Fourthly, the selection of N data objects with the greatest local outlier factor value is defined as contextual outliers. In the end, experimental results validate the effectiveness of the algorithm by using UCI data sets. (C) 2016 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据