4.5 Article

A task allocation algorithm based on reinforcement learning in spatio-temporal crowdsourcing

期刊

APPLIED INTELLIGENCE
卷 53, 期 11, 页码 13452-13469

出版社

SPRINGER
DOI: 10.1007/s10489-022-04151-6

关键词

Task allocation; Spatio-temporal crowdsourcing; Markov model; Q-learning; Policy gradient

向作者/读者索取更多资源

With the widespread use of dynamic task allocation in sharing economy applications, online bipartite graph matching has become a focus of research. This paper proposes a dynamic delay bipartite matching (DDBM) problem and designs two task allocation frameworks to increase allocation utility.
With the pervasiveness of dynamic task allocation in sharing economy applications, online bipartite graph matching has attracted more and more research attention. In sharing economy applications, crowdsourcing platforms need to allocate tasks to workers dynamically. Previous studies have low allocation utility. To increase the allocation utility of the Spatio-temporal crowdsourcing system, this paper proposes a dynamic delay bipartite matching(DDBM) problem, and designs Value Based Task Allocation(VBTA) and Policy Gradient Based Task Allocation(PGTA) frameworks respectively. According to the current state, VBTA and PGTA could enhance the allocation utility by selecting appropriate thresholds. The convergence of the algorithm is proved. Extensive experimental results on two real datasets demonstrate that the proposed algorithms are superior to the existing algorithms in effectiveness and efficiency.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据