4.7 Article

Reward design for driver repositioning using multi-agent reinforcement learning

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2020.102738

关键词

Mean field multi-agent reinforcement learning; Reward design; Bayesian optimization

资金

  1. Region 2 University Transportation Research Center (UTRC)
  2. National Science Foundation under CAREER award [CMMI-1943998]

向作者/读者索取更多资源

A large portion of passenger requests is reportedly unserviced, partially due to vacant for-hire drivers' cruising behavior during the passenger seeking process. This paper aims to model the multi-driver repositioning task through a mean field multi-agent reinforcement learning (MARL) approach that captures competition among multiple agents. Because the direct application of MARL to the multi-driver system under a given reward mechanism will likely yield a suboptimal equilibrium due to the selfishness of drivers, this study proposes a reward design scheme with which a more desired equilibrium can be reached. To effectively solve the bilevel optimization problem with upper level as the reward design and the lower level as a multi-agent system, a Bayesian optimization (BO) algorithm is adopted to speed up the learning process. We then apply the bilevel optimization model to two case studies, namely, e-hailing driver repositioning under service charge and multiclass taxi driver repositioning under NYC congestion pricing. In the first case study, the model is validated by the agreement between the derived optimal control from BO and that from an analytical solution. With a simple piecewise linear service charge, the objective of the e-hailing platform can be increased by 8.4%. In the second case study, an optimal toll charge of $5.1 is solved using BO, which improves the objective of city planners by 7.9%, compared to that without any toll charge. Under this optimal toll charge, the number of taxis in the NYC central business district is decreased, indicating a better traffic condition, without substantially increasing the crowdedness of the subway system.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
Article Transportation Science & Technology

3-Strategy evolutionary game model for operation extensions of subway networks

Yue Zhao, Liujiang Kang, Huijun Sun, Jianjun Wu, Nsabimana Buhigiro

Summary: This study proposes a 2-population 3-strategy evolutionary game model to address the issue of subway network operation extension. The analysis reveals that the rule of maximum total fitness ensures the priority of evolutionary equilibrium strategies, and proper adjustment minutes can enhance the effectiveness of operation extension.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)

Article Transportation Science & Technology

Integrated optimization of container allocation and yard cranes dispatched under delayed transshipment

Hongtao Hu, Jiao Mob, Lu Zhen

Summary: This study investigates the challenges of daily storage yard management in marine container terminals considering delayed transshipment of containers. A mixed-integer linear programming model is proposed to minimize various costs associated with transportation and yard management. The improved Benders decomposition algorithm is applied to solve the problem effectively and efficiently.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)

Article Transportation Science & Technology

Range-constrained traffic assignment for electric vehicles under heterogeneous range anxiety

Zhandong Xu, Yiyang Peng, Guoyuan Li, Anthony Chen, Xiaobo Liu

Summary: This paper studied the impact of range anxiety among electric vehicle drivers on traffic assignment. Two types of range-constrained traffic assignment problems were defined based on discrete or continuous distributed range anxiety. Models and algorithms were proposed to solve the two types of problems. Experimental results showed the superiority of the proposed algorithm and revealed that drivers with heightened range anxiety may cause severe congestion.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)

Article Transportation Science & Technology

Demand forecasting and predictability identification of ride-sourcing via bidirectional spatial-temporal transformer neural processes

Chuanjia Li, Maosi Geng, Yong Chen, Zeen Cai, Zheng Zhu, Xiqun (Michael) Chen

Summary: Understanding spatial-temporal stochasticity in shared mobility is crucial, and this study introduces the Bi-STTNP prediction model that provides probabilistic predictions and uncertainty estimations for ride-sourcing demand, outperforming conventional deep learning methods. The model captures the multivariate spatial-temporal Gaussian distribution of demand and offers comprehensive uncertainty representations.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)

Article Transportation Science & Technology

Partial trajectory method to align and validate successive video cameras for vehicle tracking

Benjamin Coifman, Lizhe Li

Summary: This paper develops a partial trajectory method for aligning views from successive fixed cameras in order to ensure high fidelity with the actual vehicle movements. The method operates on the output of vehicle tracking to provide direct feedback and improve alignment quality. Experimental results show that this method can enhance accuracy and increase the number of vehicles in the dataset.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)

Article Transportation Science & Technology

Dynamic routing for the Electric Vehicle Shortest Path Problem with charging station occupancy information

Mohsen Dastpak, Fausto Errico, Ola Jabali, Federico Malucelli

Summary: This article discusses the problem of an Electric Vehicle (EV) finding the shortest route from an origin to a destination and proposes a problem model that considers the occupancy indicator information of charging stations. A Markov Decision Process formulation is presented to optimize the EV routing and charging policy. A reoptimization algorithm is developed to establish the sequence of charging station visits and charging amounts based on system updates. Results from a comprehensive computational study show that the proposed method significantly reduces waiting times and total trip duration.

TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES (2024)