4.2 Article

A linear time method for the detection of collective and point anomalies

Journal

STATISTICAL ANALYSIS AND DATA MINING
Volume 15, Issue 4, Pages 494-508

Publisher

WILEY
DOI: 10.1002/sam.11586

Keywords

dynamic programming; epidemic changepoints; exoplanets; Numenta Anomaly Benchmark; outliers; robust statistics

Funding

  1. British Telecommunications plc
  2. Engineering and Physical Sciences Research Council [EP/K032208/1, EP/L015692/1, EP/N031938/1, EP/R014604/1]

Ask authors/readers for more resources

This article introduces a computationally efficient approach called CAPA for efficiently identifying anomalies in data sequences, particularly suitable for detecting collective anomalies. Empirical results demonstrate that CAPA has lower computational cost and higher accuracy, making it applicable in various fields.
The challenge of efficiently identifying anomalies in data sequences is an important statistical problem that now arises in many applications. Although there has been substantial work aimed at making statistical analyses robust to outliers, or point anomalies, there has been much less work on detecting anomalous segments, or collective anomalies, particularly in those settings where point anomalies might also occur. In this article, we introduce collective and point anomalies (CAPA), a computationally efficient approach that is suitable when collective anomalies are characterized by either a change in mean, variance, or both, and distinguishes them from point anomalies. Empirical results show that CAPA has close to linear computational cost as well as being more accurate at detecting and locating collective anomalies than other approaches. We demonstrate the utility of CAPA through its ability to detect exoplanets from light curve data from the Kepler telescope and its capacity to detect machine faults from temperature data.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.2
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available