High-level Decisions from a Safe Maneuver Catalog with Reinforcement Learning for Safe and Cooperative Automated Merging

Reinforcement learning (RL) has recently been used for solving challenging decision-making problems in the context of automated driving. However, one of the main drawbacks of the presented RL-based policies is the lack of safety guarantees, since they strive to reduce the expected number of collisions but still tolerate them. In this paper, we propose an efficient RL-based decision-making pipeline for safe and cooperative automated driving in merging scenarios. The RL agent is able to predict the current situation and provide high-level decisions, specifying the operation mode of the low level planner which is responsible for safety. In order to learn a more generic policy, we propose a scalable RL architecture for the merging scenario that is not sensitive to changes in the environment configurations. According to our experiments, the proposed RL agent can efficiently identify cooperative drivers from their vehicle state history and generate interactive maneuvers, resulting in faster and more comfortable automated driving. At the same time, thanks to the safety constraints inside the planner, all of the maneuvers are collision free and safe.

[1]  Matthias Althoff,et al.  Online Verification of Automated Road Vehicles Using Reachability Analysis , 2014, IEEE Transactions on Robotics.

[2]  Mario Zanon,et al.  Real-Time Constrained Trajectory Planning and Vehicle Control for Proactive Autonomous Driving With Road Users , 2019, 2019 18th European Control Conference (ECC).

[3]  Helbing,et al.  Congested traffic states in empirical observations and microscopic simulations , 2000, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[4]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[5]  Alex Graves,et al.  Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.

[6]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7]  Christoph Stiller,et al.  A Belief State Planner for Interactive Merge Maneuvers in Congested Traffic , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[8]  Martin Lauer,et al.  Risk-Aware High-level Decisions for Automated Driving at Occluded Intersections with Reinforcement Learning , 2020, 2020 IEEE Intelligent Vehicles Symposium (IV).

[9]  Mykel J. Kochenderfer,et al.  Utility Decomposition with Deep Corrections for Scalable Planning under Uncertainty , 2018, AAMAS.

[10]  Peter Dayan,et al.  Q-learning , 1992, Machine Learning.

[11]  David Isele,et al.  Navigating Occluded Intersections with Autonomous Vehicles Using Deep Reinforcement Learning , 2018, 2018 IEEE International Conference on Robotics and Automation (ICRA).

[12]  Alexander J. Smola,et al.  Deep Sets , 2017, 1703.06114.

[13]  Jonas Sjöberg,et al.  Learning When to Drive in Intersections by Combining Reinforcement Learning and Model Predictive Control , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[14]  Mykel J. Kochenderfer,et al.  Cooperation-Aware Reinforcement Learning for Merging in Dense Traffic , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[15]  Amnon Shashua,et al.  Safe, Multi-Agent, Reinforcement Learning for Autonomous Driving , 2016, ArXiv.

[16]  Richard S. Sutton,et al.  Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.

[17]  Gabriel Kalweit,et al.  Dynamic Input for Deep Reinforcement Learning in Autonomous Driving , 2019, 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[18]  Johannes Müller,et al.  A Risk and Comfort Optimizing Motion Planning Scheme for Merging Scenarios* , 2019, 2019 IEEE Intelligent Transportation Systems Conference (ITSC).

[19]  Matthias Mayr,et al.  Lanelet2: A high-definition map framework for the future of automated driving , 2018, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[20]  David Silver,et al.  Deep Reinforcement Learning with Double Q-Learning , 2015, AAAI.