4.7 Article

Connected automated vehicle cooperative control with a deep reinforcement learning approach in a mixed traffic environment

出版社

PERGAMON-ELSEVIER SCIENCE LTD
DOI: 10.1016/j.trc.2021.103421

关键词

Mixed connected automated traffic; environment; Cooperative control; Deep reinforcement learning; Traffic oscillation dampening; Energy efficiency

资金

  1. Wisconsin Traffic Operations and Safety (TOPS) Laboratory

向作者/读者索取更多资源

This paper presents a cooperative strategy for longitudinal control of vehicles in a mixed connected and automated traffic environment using a deep reinforcement learning algorithm. By decomposing mixed traffic into multiple subsystems and enabling CAVs to learn from leading HDVs, the proposed method improves performance locally for each subsystem and overall for the traffic flow.
This paper proposes a cooperative strategy of connected and automated vehicles (CAVs) longitudinal control for a mixed connected and automated traffic environment based on deep reinforcement learning (DRL) algorithm, which enhances the string stability of mixed traffic, car following efficiency, and energy efficiency. Since the sequences of mixed traffic are combinatory, to reduce the training dimension and alleviate communication burdens, we decomposed mixed traffic into multiple subsystems where each subsystem is comprised of human-driven vehicles (HDV) followed by cooperative CAVs. Based on that, a cooperative CAV control strategy is developed based on a deep reinforcement learning algorithm, enabling CAVs to learn the leading HDV's characteristics and make longitudinal control decisions cooperatively to improve the performance of each subsystem locally and consequently enhance performance for the whole mixed traffic flow. For training, a distributed proximal policy optimization is applied to ensure the training convergence of the proposed DRL. To verify the effectiveness of the proposed method, simulated experiments are conducted, which shows the performance of our proposed model has a great generalization capability of dampening oscillations, fulfilling the car following and energy-saving tasks efficiently under different penetration rates and various leading HDVs behaviors.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据