4.7 Article

Shadow-Price DRL: A Framework for Online Scheduling of Shared Autonomous EVs Fleets

期刊

IEEE TRANSACTIONS ON SMART GRID
卷 13, 期 4, 页码 3106-3117

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TSG.2022.3155455

关键词

Costs; Electric vehicle charging; Wind power generation; Transportation; Schedules; Renewable energy sources; Neural networks; Electric vehicle; electric power and transportation network; charging scheduling; deep reinforcement learning; shadow prices

资金

  1. National Natural Science Foundation of China [52177113, U1766205]
  2. Science and Technology Program of SGCC [5400202099508A-0-0-00, TSG-01291-2021]

向作者/读者索取更多资源

This paper focuses on the online scheduling problem of shared autonomous electric vehicle (SAEV) fleets, including charging management, routing, and rebalancing strategies. A novel framework called shadow-price deep reinforcement learning (shadow-price DRL) is proposed to address the challenge of time-varying trip demands. The framework combines a rigorous PTN operation model and a data-driven model-free DRL-based algorithm. Case studies validate the effectiveness of the proposed method.
This paper studies the online scheduling of shared autonomous electric vehicle (SAEV) fleets. The study includes charging management, routing and rebalancing strategies for SAEVs to serve the trip demands in the coupled power and transportation network (PTN). It aims to minimize the total social cost of PTN. The difficulty lies in how to deal with time-varying trip demands with the time-coupled SAEV scheduling and PTN operation considered. To address this challenge, for the first time, we propose a novel framework named the shadow-price deep reinforcement learning (shadow-price DRL), which combines the rigorous PTN operation model and the data-driven model-free DRL-based algorithm. Within the shadow-price DRL, the policy neural network adaptively learns the system dynamics and imposes its actions on the online SAEV scheduling problem as the dynamic shadow prices. By doing so, the SAEV schedule will be determined in the online manner. In addition, the connection between the proposed framework and Lagrangian Relaxation method is discussed, which illustrates the principles and effectiveness of the proposed method. The case studies include the practical Xi'an city which verifies the effectiveness of the shadow-price DRL and illustrates its significant superiority over the safe DRL and model predictive control (MPC) based methods.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据