☆ 4.7 Article

A precise ranking method for outlier detection

INFORMATION SCIENCES (2015)

期刊

INFORMATION SCIENCES

卷 324, 期 -, 页码 88-107

出版社

ELSEVIER SCIENCE INC

DOI: 10.1016/j.ins.2015.06.030

关键词

Outlier detection; Observability factor; Random sampling; Nearest neighbors; Information entropy

类别

Computer Science, Information Systems

资金

Basic Science Research Program through the National Research Foundation of Korea (NRF) - Ministry of Education, Science and Technology [2012R1A1A1012153]
National Research Foundation of Korea [2012R1A1A1012153] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Recent research studies on outlier detection have focused on examining the nearest neighbor structure of a data object to measure its outlierness degree. This leads to two weaknesses: the size of nearest neighborhood, which should be predetermined, greatly affects the final detection results, and the outlierness scores produced by existing methods are not sufficiently diverse to allow precise ranking of outliers. To overcome these problems, in this research paper, a novel outlier detection method involving an iterative random sampling procedure is proposed. The proposed method is inspired by the simple notion that outlying objects are less easily selected than inlying objects in blind random sampling, and therefore, more inlierness scores are given to selected objects. We develop a new measure called the observability factor (OF) by utilizing this idea. In order to offer a heuristic guideline to determine the best size of nearest neighborhood, we additionally propose using the entropy of OF scores. An intensive numerical evaluation based on various synthetic and real-world datasets shows the superiority and effectiveness of the proposed method. (C) 2015 Elsevier Inc. All rights reserved.

A precise ranking method for outlier detection

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

A precise ranking method for outlier detection

期刊

INFORMATION SCIENCES

出版社

ELSEVIER SCIENCE INC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文