☆ 4.7 Article

Generative Adversarial Active Learning for Unsupervised Outlier Detection

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2020)

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

卷 32, 期 8, 页码 1517-1528

出版社

IEEE COMPUTER SOC

DOI: 10.1109/TKDE.2019.2905606

关键词

Anomaly detection; Generators; Computational modeling; Data models; Training; Generative adversarial networks; Gallium nitride; Outlier detection; generate potential outliers; curse of dimensionality; generative adversarial active learning; mode collapsing problem; multiple-objective generative adversarial active learning

类别

Computer Science, Artificial Intelligence Computer Science, Information Systems Engineering, Electrical & Electronic

资金

Major Program of the National Natural Science Foundation of China [91846201, 71490725]
Foundation for Innovative Research Groups of the National Natural Science Foundation of China [71521001]
National Natural Science Foundation of China [71722010, 91546114, 91746302, 71872060]
National Key Research and Development Program of China [2017YFB0803303]
Project of Thousand Youth Talents 2018

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Outlier detection is an important topic in machine learning and has been used in a wide range of applications. In this paper, we approach outlier detection as a binary-classification issue by sampling potential outliers from a uniform reference distribution. However, due to the sparsity of data in high-dimensional space, a limited number of potential outliers may fail to provide sufficient information to assist the classifier in describing a boundary that can separate outliers from normal data effectively. To address this, we propose a novel Single-Objective Generative Adversarial Active Learning (SO-GAAL) method for outlier detection, which can directly generate informative potential outliers based on the mini-max game between a generator and a discriminator. Moreover, to prevent the generator from falling into the mode collapsing problem, the stop node of training should be determined when SO-GAAL is able to provide sufficient information. But without any prior information, it is extremely difficult for SO-GAAL. Therefore, we expand the network structure of SO-GAAL from a single generator to multiple generators with different objectives (MO-GAAL), which can generate a reasonable reference distribution for the whole dataset. We empirically compare the proposed approach with several state-of-the-art outlier detection methods on both synthetic and real-world datasets. The results show that MO-GAAL outperforms its competitors in the majority of cases, especially for datasets with various cluster types or high irrelevant variable ratio. The experiment codes are available at: https://github.com/leibinghe/GAAL-based-outlier-detection.

Generative Adversarial Active Learning for Unsupervised Outlier Detection

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Generative Adversarial Active Learning for Unsupervised Outlier Detection

期刊

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING

出版社

IEEE COMPUTER SOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文