4.7 Article

Generative Adversarial Active Learning for Unsupervised Outlier Detection

期刊

出版社

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2019.2905606

关键词

Anomaly detection; Generators; Computational modeling; Data models; Training; Generative adversarial networks; Gallium nitride; Outlier detection; generate potential outliers; curse of dimensionality; generative adversarial active learning; mode collapsing problem; multiple-objective generative adversarial active learning

资金

  1. Major Program of the National Natural Science Foundation of China [91846201, 71490725]
  2. Foundation for Innovative Research Groups of the National Natural Science Foundation of China [71521001]
  3. National Natural Science Foundation of China [71722010, 91546114, 91746302, 71872060]
  4. National Key Research and Development Program of China [2017YFB0803303]
  5. Project of Thousand Youth Talents 2018

向作者/读者索取更多资源

Outlier detection is an important topic in machine learning and has been used in a wide range of applications. In this paper, we approach outlier detection as a binary-classification issue by sampling potential outliers from a uniform reference distribution. However, due to the sparsity of data in high-dimensional space, a limited number of potential outliers may fail to provide sufficient information to assist the classifier in describing a boundary that can separate outliers from normal data effectively. To address this, we propose a novel Single-Objective Generative Adversarial Active Learning (SO-GAAL) method for outlier detection, which can directly generate informative potential outliers based on the mini-max game between a generator and a discriminator. Moreover, to prevent the generator from falling into the mode collapsing problem, the stop node of training should be determined when SO-GAAL is able to provide sufficient information. But without any prior information, it is extremely difficult for SO-GAAL. Therefore, we expand the network structure of SO-GAAL from a single generator to multiple generators with different objectives (MO-GAAL), which can generate a reasonable reference distribution for the whole dataset. We empirically compare the proposed approach with several state-of-the-art outlier detection methods on both synthetic and real-world datasets. The results show that MO-GAAL outperforms its competitors in the majority of cases, especially for datasets with various cluster types or high irrelevant variable ratio. The experiment codes are available at: https://github.com/leibinghe/GAAL-based-outlier-detection.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据