Cooperative Learning for Smart Charging of Shared Autonomous Vehicle Fleets
研究了共享自动驾驶电动车队如何通过协同多智能体强化学习,学习智能充电策略(何时何地充电),以应对不确定需求、长充电时间和时变电价,最大化利润和服务质量。
We study the operational problem of shared autonomous electric vehicles that cooperate in providing on-demand mobility services while maximizing fleet profit and service quality. Therefore, we model the fleet operator and vehicles as interactive agents enriched with advanced decision-making aids. Our focus is on learning smart charging policies (when and where to charge vehicles) in anticipation of uncertain future demands to accommodate long charging times, restricted charging infrastructure, and time-varying electricity prices. We propose a distributed approach and formulate the problem as a semi-Markov decision process to capture its stochastic and dynamic nature. We use cooperative multiagent reinforcement learning with reshaped reward functions. The effectiveness and scalability of the proposed model are upgraded through deep learning. A mean-field approximation deals with environment instabilities, and hierarchical learning distinguishes high-level and low-level decisions. We evaluate our model using various numerical examples based on real data from ShareNow in Berlin, Germany. We show that the policies learned using our decentralized and dynamic approach outperform central static charging strategies. Finally, we conduct a sensitivity analysis for different fleet characteristics to demonstrate the proposed model’s robustness and provide managerial insights into the impacts of strategic decisions on fleet performance and derived charging policies. Supplemental Material: The online appendix is available at https://doi.org/10.1287/trsc.2022.1187 .