4.7 Article

Optimizing matching time intervals for ride-hailing services using reinforcement learning

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2021.103239

关键词

Ride-hailing service; Online matching; Reinforcement learning; Policy gradient method

资金

  1. National Science Foundation [CMMI-1854684, CMMI-1904575]
  2. China Scholarship Council (CSC) [201806260144]

向作者/读者索取更多资源

Efficiently matching trip requests and available drivers in ride-hailing services is crucial, with a strategy of delaying matching to improve efficiency. The optimal delayed matching depends on the trade-off between delay penalty and reduced wait cost, and tailored reinforcement learning-based methods are used to overcome challenges. This work also addresses spatial partitioning balance and provides insights into delayed matching policies with real-world taxi trajectory data.
Matching trip requests and available drivers efficiently is considered a central operational problem for ride-hailing services. A widely adopted matching strategy is to accumulate a batch of potential passenger-driver matches and solve bipartite matching problems repeatedly. The efficiency of matching can be improved substantially if the matching is delayed by adaptively adjusting the matching time interval. The optimal delayed matching is subject to the trade-off between the delay penalty and the reduced wait cost and is dependent on the system's supply and demand states. Searching for the optimal delayed matching policy is challenging, as the current policy is compounded with past actions. To this end, we tailor a family of reinforcement learning-based methods to overcome the curse of dimensionality and sparse reward issues. In addition, this work provides a solution to spatial partitioning balance between the state representation error and the optimality gap of asynchronous matching. Lastly, we examine the proposed methods with real-world taxi trajectory data and garner managerial insights into the general delayed matching policies. The focus of this work is single-ride service due to limited access to shared ride data, while the general framework can be extended to the setting with a ridepooling component.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据