4.6 Article

Online sequential extreme studentized deviate tests for anomaly detection in streaming data with varying patterns

出版社

SPRINGER
DOI: 10.1007/s10586-021-03236-0

关键词

Anomaly detection; Generalized extreme studentized deviate (GESD) test; Time series; Streaming data

资金

  1. Ministry of Education of the Republic of Korea
  2. National Research Foundation of Korea [NRF-2020R1F1A1076278]

向作者/读者索取更多资源

In the new era of big data, the importance of real-time contextual anomaly detection is rapidly increasing. Many anomaly detection algorithms have weaknesses in dealing with streaming time-series data containing different patterns. This paper proposes an online contextual anomaly detection method that shows a clear advantage in analyzing streaming data with varying patterns.
In the new era of big data, numerous information and technology systems can store huge amounts of streaming data in real time, for example, in server-access logs on web application servers. The importance of anomaly detection in voluminous quantities of streaming data from such systems is rapidly increasing. One of the biggest challenges in the detection task is to carry out real-time contextual anomaly detection in streaming data with varying patterns that are visually detectable but unsuitable for a parametric model. Most anomaly detection algorithms have weaknesses in dealing with streaming time-series data containing such patterns. In this paper, we propose a novel method for online contextual anomaly detection in streaming time-series data using generalized extreme studentized deviates (GESD) tests. The GESD test is relatively accurate and efficient because it performs statistical hypothesis testing but it is unable to handle streaming time-series data. Thus, focusing on streaming time-series data, we propose an online version of the test capable of detecting outliers under varying patterns. We perform extensive experiments with simulated data, syntactic data, and real online traffic data from Yahoo Webscope, showing a clear advantage of the proposed method, particularly for analyzing streaming data with varying patterns.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据