4.6 Article

Parallel exploration via negatively correlated search

期刊

FRONTIERS OF COMPUTER SCIENCE
卷 15, 期 5, 页码 -

出版社

HIGHER EDUCATION PRESS
DOI: 10.1007/s11704-020-0431-0

关键词

evolutionary computation; reinforcement learning; exploration

资金

  1. Natural Science Foundation of China [61806090, 61672478]
  2. Guangdong Provincial Key Laboratory [2020B121201001]
  3. Program for Guangdong Introducing Innovative and Entrepreneurial Teams [2017ZT07X386]
  4. Science and Technology Commission of Shanghai Municipality [19511120600]
  5. Shenzhen Science and Technology Program [KQTD2016112514355531]

向作者/读者索取更多资源

Effective exploration is crucial for a successful search process. The NCNES method presented in this paper shows the importance of coordinated parallel exploration and demonstrates significant advantages, especially in games with uncertain and delayed rewards.
Effective exploration is key to a successful search process. The recently proposed negatively correlated search (NCS) tries to achieve this by coordinated parallel exploration, where a set of search processes are driven to be negatively correlated so that different promising areas of the search space can be visited simultaneously. Despite successful applications of NCS, the negatively correlated search behaviors were mostly devised by intuition, while deeper (e.g., mathematical) understanding is missing. In this paper, a more principled NCS, namely NCNES, is presented, showing that the parallel exploration is equivalent to a process of seeking probabilistic models that both lead to solutions of high quality and are distant from previous obtained probabilistic models. Reinforcement learning, for which exploration is of particular importance, are considered for empirical assessment. The proposed NCNES is applied to directly train a deep convolution network with 1.7 million connection weights for playing Atari games. Empirical results show that the significant advantages of NCNES, especially on games with uncertain and delayed rewards, can be highly owed to the effective parallel exploration ability.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据