3.8 Proceedings Paper

The Analysis of Online Event Streams: Predicting the Next Activity for Anomaly Detection

期刊

RESEARCH CHALLENGES IN INFORMATION SCIENCE
卷 446, 期 -, 页码 248-264

出版社

SPRINGER INTERNATIONAL PUBLISHING AG
DOI: 10.1007/978-3-031-05760-1_15

关键词

Process mining; Event stream; Anomaly detection

向作者/读者索取更多资源

This paper investigates online event anomaly detection using next-activity prediction methods. The probabilities of next-activities are predicted using ML and deep models, and events predicted unlikely are considered as anomalies. The evaluation shows that the proposed ML model-based method tends to outperform the deep model-based method and classical unsupervised approaches in detecting anomalous events.
Anomaly detection in process mining focuses on identifying anomalous cases or events in process executions. The resulting diagnostics are used to provide measures to prevent fraudulent behavior, as well as to derive recommendations for improving process compliance and security. Most existing techniques focus on detecting anomalous cases in an offline setting. However, to identify potential anomalies in a timely manner and take immediate countermeasures, it is necessary to detect event-level anomalies online, in real-time. In this paper, we propose to tackle the online event anomaly detection problem using next-activity prediction methods. More specifically, we investigate the use of both ML models (such as RF and XGBoost) and deep models (such as LSTM) to predict the probabilities of next-activities and consider the events predicted unlikely as anomalies. We compare these predictive anomaly detection methods to four classical unsupervised anomaly detection approaches (such as Isolation forest and LOF) in the online setting. Our evaluation shows that the proposed method using ML models tends to outperform the one using a deep model, while both methods outperform the classical unsupervised approaches in detecting anomalous events.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

3.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据