☆ 4.7 Article

Optimizing matching time intervals for ride-hailing services using reinforcement learning

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2021)

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

卷 129, 期 -, 页码 -

出版社

PERGAMON-ELSEVIER SCIENCE LTD

DOI: 10.1016/j.trc.2021.103239

关键词

Ride-hailing service; Online matching; Reinforcement learning; Policy gradient method

类别

Transportation Science & Technology

资金

National Science Foundation [CMMI-1854684, CMMI-1904575]
China Scholarship Council (CSC) [201806260144]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Efficiently matching trip requests and available drivers in ride-hailing services is crucial, with a strategy of delaying matching to improve efficiency. The optimal delayed matching depends on the trade-off between delay penalty and reduced wait cost, and tailored reinforcement learning-based methods are used to overcome challenges. This work also addresses spatial partitioning balance and provides insights into delayed matching policies with real-world taxi trajectory data.

Matching trip requests and available drivers efficiently is considered a central operational problem for ride-hailing services. A widely adopted matching strategy is to accumulate a batch of potential passenger-driver matches and solve bipartite matching problems repeatedly. The efficiency of matching can be improved substantially if the matching is delayed by adaptively adjusting the matching time interval. The optimal delayed matching is subject to the trade-off between the delay penalty and the reduced wait cost and is dependent on the system's supply and demand states. Searching for the optimal delayed matching policy is challenging, as the current policy is compounded with past actions. To this end, we tailor a family of reinforcement learning-based methods to overcome the curse of dimensionality and sparse reward issues. In addition, this work provides a solution to spatial partitioning balance between the state representation error and the optimality gap of asynchronous matching. Lastly, we examine the proposed methods with real-world taxi trajectory data and garner managerial insights into the general delayed matching policies. The focus of this work is single-ride service due to limited access to shared ride data, while the general framework can be extended to the setting with a ridepooling component.

Optimizing matching time intervals for ride-hailing services using reinforcement learning

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Optimizing matching time intervals for ride-hailing services using reinforcement learning

期刊

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES

出版社

PERGAMON-ELSEVIER SCIENCE LTD

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文