☆ 4.8 Article

Dense reinforcement learning for safety validation of autonomous vehicles

NATURE (2023)

期刊

NATURE

卷 615, 期 7953, 页码 620-+

出版社

NATURE PORTFOLIO

DOI: 10.1038/s41586-023-05732-2

关键词

类别

Multidisciplinary Sciences

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

A critical bottleneck for autonomous vehicle development and deployment is the high costs required to validate safety in real-world driving. Researchers have developed an intelligent testing environment using AI-based agents to accelerate the safety validation process without bias. Their approach reduces testing time by orders of magnitude and can also be applied to other safety-critical autonomous systems.

One critical bottleneck that impedes the development and deployment of autonomous vehicles is the prohibitively high economic and time costs required to validate their safety in a naturalistic driving environment, owing to the rarity of safety-critical events(1). Here we report the development of an intelligent testing environment, where artificial-intelligencebased background agents are trained to validate the safety performances of autonomous vehicles in an accelerated mode, without loss of unbiasedness. From naturalistic driving data, the background agents learn what adversarial manoeuvre to execute through a dense deep-reinforcement-learning (D2RL) approach, in which Markov decision processes are edited by removing non-safety-critical states and reconnecting critical ones so that the information in the training data is densified. D2RL enables neural networks to learn from densified information with safety-critical events and achieves tasks that are intractable for traditional deep-reinforcement-learning approaches. We demonstrate the effectiveness of our approach by testing a highly automated vehicle in both highway and urban test tracks with an augmented-reality environment, combining simulated background vehicles with physical road infrastructure and a real autonomous test vehicle. Our results show that the D2RL-trained agents can accelerate the evaluation process by multiple orders of magnitude (103 to 105 times faster). In addition, D2RL will enable accelerated testing and training with other safety-critical autonomous systems.

Dense reinforcement learning for safety validation of autonomous vehicles

期刊

NATURE

出版社

NATURE PORTFOLIO

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Dense reinforcement learning for safety validation of autonomous vehicles

期刊

NATURE

出版社

NATURE PORTFOLIO

关键词

类别

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文