4.5 Article

Dynamic swarm class rebalancing for the process mining of rare events

期刊

JOURNAL OF SUPERCOMPUTING
卷 77, 期 7, 页码 7549-7583

出版社

SPRINGER
DOI: 10.1007/s11227-020-03500-x

关键词

Process mining; Class imbalance; Classification; Meta-heuristic; Over-sampling; Under-sampling

资金

  1. University of Macau [MYRG2016-00069-FST]
  2. FST
  3. RDAO

向作者/读者索取更多资源

Process mining is increasingly crucial in workflow model reconstructions, and the efficacy of this method relies on data mining algorithms accurately classifying future events from process logs. Our proposed methods address class imbalance by integrating swarm intelligence algorithms, aiming to enhance classification accuracy and confidence levels in process mining.
Process mining is becoming an indispensable method in workflow model reconstructions, offering insights into mission critical systems. The efficacy of process mining depends on whether the underlying data mining algorithms can accurately classify or predict future events from process logs. However, exceptional events are scarce in most operational processes. Hence, the process logs generated from these processes are highly imbalanced. It is quite often that any model learned from imbalanced data tends to be overly generalized toward the normal classes but under-trained to recognize the rare classes. In this paper, we propose 3 methods to rectify this class imbalance problem. They are founded upon a meta-heuristic-swarm intelligence algorithm. The first method, and also the base of the remaining 2 methods, is Dynamic Multi-objective Rebalancing Algorithm, which considers both high accuracy and high confidence level of classification in its objective function, and it is draw upon the particle swarm optimization algorithm. The other two algorithms are hybrid methods by combining the first base method with over-sampling and under-sampling techniques. Experiments are conducted using the three above-mentioned methods to process rebalanced dataset, as well as using other classic resampling methods for comparison. According to the results, our proposed methods show satisfactory performance over other comparison methods, and we extracted meaningful decision rules from a rebalanced dataset in process mining.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据