☆ 4.5 Article

US-Rule: Discovering Utility-driven Sequential Rules

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA (2023)

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

卷 17, 期 1, 页码 -

出版社

ASSOC COMPUTING MACHINERY

DOI: 10.1145/3532613

关键词

Data mining; pattern mining; sequential rule; utility mining

类别

Computer Science, Information Systems Computer Science, Software Engineering

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

In this article, a faster algorithm called US-Rule is proposed for efficiently mining high-utility sequential rules. It utilizes the rule estimated utility co-occurrence pruning strategy to avoid meaningless computations. Moreover, four tighter upper bounds and corresponding pruning strategies are designed to improve efficiency on dense and long sequence datasets. US-Rule also proposes the rule estimated utility recomputing pruning strategy to deal with sparse datasets. Experimental results demonstrate that US-Rule outperforms existing algorithms in terms of execution time, memory consumption, and scalability.

Utility-driven mining is an important task in data science and has many applications in real life. High-utility sequential pattern mining (HUSPM) is one kind of utility-driven mining. It aims at discovering all sequential patterns with high utility. However, the existing algorithms of HUSPM can not provide a relatively accurate probability to deal with some scenarios for prediction or recommendation. High-utility sequential rule mining (HUSRM) is proposed to discover all sequential rules with high utility and high confidence. There is only one algorithm proposed for HUSRM, which is not efficient enough. In this article, we propose a faster algorithm called US-Rule, to efficiently mine high-utility sequential rules. It utilizes the rule estimated utility co-occurrence pruning strategy (REUCP) to avoid meaningless computations. Moreover, to improve its efficiency on dense and long sequence datasets, four tighter upper bounds (LEEU, REEU, LERSU, and RERSU) and corresponding pruning strategies (LEEUP, REEUP, LERSUP, and RERSUP) are designed. US-Rule also proposes the rule estimated utility recomputing pruning strategy (REURP) to deal with sparse datasets. Finally, a large number of experiments on different datasets compared to the state-of-the-art algorithm demonstrate that US-Rule can achieve better performance in terms of execution time, memory consumption, and scalability.

US-Rule: Discovering Utility-driven Sequential Rules

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

US-Rule: Discovering Utility-driven Sequential Rules

期刊

ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA

出版社

ASSOC COMPUTING MACHINERY

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文