4.2 Article

Process mining: a two-step approach to balance between underfitting and overfitting

期刊

SOFTWARE AND SYSTEMS MODELING
卷 9, 期 1, 页码 87-111

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s10270-008-0106-z

关键词

-

向作者/读者索取更多资源

Process mining includes the automated discovery of processes from event logs. Based on observed events (e.g., activities being executed or messages being exchanged) a process model is constructed. One of the essential problems in process mining is that one cannot assume to have seen all possible behavior. At best, one has seen a representative subset. Therefore, classical synthesis techniques are not suitable as they aim at finding a model that is able to exactly reproduce the log. Existing process mining techniques try to avoid such overfitting by generalizing the model to allow for more behavior. This generalization is often driven by the representation language and very crude assumptions about completeness. As a result, parts of the model are overfitting (allow only for what has actually been observed) while other parts may be underfitting (allow for much more behavior without strong support for it). None of the existing techniques enables the user to control the balance between overfitting and underfitting. To address this, we propose a two-step approach. First, using a configurable approach, a transition system is constructed. Then, using the theory of regions, the model is synthesized. The approach has been implemented in the context of ProM and overcomes many of the limitations of traditional approaches.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据