☆ 4.6 Article

Parallel exploration via negatively correlated search

FRONTIERS OF COMPUTER SCIENCE (2021)

期刊

FRONTIERS OF COMPUTER SCIENCE

卷 15, 期 5, 页码 -

出版社

HIGHER EDUCATION PRESS

DOI: 10.1007/s11704-020-0431-0

关键词

evolutionary computation; reinforcement learning; exploration

类别

Computer Science, Information Systems Computer Science, Software Engineering Computer Science, Theory & Methods

资金

Natural Science Foundation of China [61806090, 61672478]
Guangdong Provincial Key Laboratory [2020B121201001]
Program for Guangdong Introducing Innovative and Entrepreneurial Teams [2017ZT07X386]
Science and Technology Commission of Shanghai Municipality [19511120600]
Shenzhen Science and Technology Program [KQTD2016112514355531]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Effective exploration is crucial for a successful search process. The NCNES method presented in this paper shows the importance of coordinated parallel exploration and demonstrates significant advantages, especially in games with uncertain and delayed rewards.

Effective exploration is key to a successful search process. The recently proposed negatively correlated search (NCS) tries to achieve this by coordinated parallel exploration, where a set of search processes are driven to be negatively correlated so that different promising areas of the search space can be visited simultaneously. Despite successful applications of NCS, the negatively correlated search behaviors were mostly devised by intuition, while deeper (e.g., mathematical) understanding is missing. In this paper, a more principled NCS, namely NCNES, is presented, showing that the parallel exploration is equivalent to a process of seeking probabilistic models that both lead to solutions of high quality and are distant from previous obtained probabilistic models. Reinforcement learning, for which exploration is of particular importance, are considered for empirical assessment. The proposed NCNES is applied to directly train a deep convolution network with 1.7 million connection weights for playing Atari games. Empirical results show that the significant advantages of NCNES, especially on games with uncertain and delayed rewards, can be highly owed to the effective parallel exploration ability.

Parallel exploration via negatively correlated search

期刊

FRONTIERS OF COMPUTER SCIENCE

出版社

HIGHER EDUCATION PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Parallel exploration via negatively correlated search

期刊

FRONTIERS OF COMPUTER SCIENCE

出版社

HIGHER EDUCATION PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文