4.5 Article

Isolation-Based Anomaly Detection

Journal

Publisher

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2133360.2133363

Keywords

Anomaly detection; outlier detection; ensemble methods; binary tree; random tree ensemble; isolation; isolation forest

Funding

  1. Australian Postgraduate Awards (APA)
  2. Information and Communications Technologies (ICT) Postgraduate Research Scholarships
  3. National Science Foundation of China [61073097, 61021062]
  4. National Fundamental Research Program of China [2010CB327903]
  5. Jiangsu Science Foundation [BK2008018]

Ask authors/readers for more resources

Anomalies are data points that are few and different. As a result of these properties, we show that, anomalies are susceptible to a mechanism called isolation. This article proposes a method called Isolation Forest (iForest), which detects anomalies purely based on the concept of isolation without employing any distance or density measure-fundamentally different from all existing methods. As a result, iForest is able to exploit subsampling (i) to achieve a low linear time-complexity and a small memory-requirement and (ii) to deal with the effects of swamping and masking effectively. Our empirical evaluation shows that iForest outperforms ORCA, one-class SVM, LOF and Random Forests in terms of AUC, processing time, and it is robust against masking and swamping effects. iForest also works well in high dimensional problems containing a large number of irrelevant attributes, and when anomalies are not available in training sample.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available