☆ 4.7 Article

Deep Reinforcement Learning: A Survey

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2022)

Journal

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Volume -, Issue -, Pages -

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TNNLS.2022.3207346

Keywords

Task analysis; Mathematical models; Deep learning; Trajectory; Behavioral sciences; Q-learning; Dynamic programming; Deep learning; deep reinforcement learning (DRL); imitation learning; maximum entropy deep reinforcement learning (RL); policy gradient; value function

Funding

National Key Research and Development Program of China [2018YFC0807500]
National Natural Science Foundations of China [61772396, 61772392, 61902296, 61825305]
Xi'an Key Laboratory of Big Data and Intelligent Vision [201805053ZD4CG37]
National Natural Science Foundation of Shaanxi Province [2020JQ-330, 2020JM-195]
China Postdoctoral Science Foundation [2019M663640]
Guangxi Key Laboratory of Trusted Software [KX202061]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Deep reinforcement learning (DRL) is a technology that combines deep learning with reinforcement learning, achieving significant progress in tasks that involve perceiving high-dimensional input and making optimal decisions. However, there are still challenges in both theory and applications, especially in scenarios with limited samples, sparse rewards, and multiple agents. Researchers have proposed solutions and new theories to advance DRL and stimulate the development of subfields in reinforcement learning.

Deep reinforcement learning (DRL) integrates the feature representation ability of deep learning with the decision-making ability of reinforcement learning so that it can achieve powerful end-to-end learning control capabilities. In the past decade, DRL has made substantial advances in many tasks that require perceiving high-dimensional input and making optimal or near-optimal decisions. However, there are still many challenging problems in the theory and applications of DRL, especially in learning control tasks with limited samples, sparse rewards, and multiple agents. Researchers have proposed various solutions and new theories to solve these problems and promote the development of DRL. In addition, deep learning has stimulated the further development of many subfields of reinforcement learning, such as hierarchical reinforcement learning (HRL), multiagent reinforcement learning, and imitation learning. This article gives a comprehensive overview of the fundamental theories, key algorithms, and primary research domains of DRL. In addition to value-based and policy-based DRL algorithms, the advances in maximum entropy-based DRL are summarized. The future research topics of DRL are also analyzed and discussed.

Deep Reinforcement Learning: A Survey

Journal

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Deep Reinforcement Learning: A Survey

Journal

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper