4.7 Article

Exploration, novelty, surprise, and free energy minimization

期刊

FRONTIERS IN PSYCHOLOGY
卷 4, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fpsyg.2013.00710

关键词

active inference; exploration; exploitation; novelty; reinforcement learning; free energy

资金

  1. Wellcome Trust [091593, 088130, 098362] Funding Source: Medline

向作者/读者索取更多资源

This paper reviews recent developments under the free energy principle that introduce a normative perspective on classical economic (utilitarian) decision-making based on (active) Bayesian inference. It has been suggested that the free energy principle precludes novelty and complexity, because it assumes that biological systems-like ourselves-try to minimize the long-term average of surprise to maintain their homeostasis. However, recent formulations show that minimizing surprise leads naturally to concepts such as exploration and novelty bonuses. In this approach, agents infer a policy that minimizes surprise by minimizing the difference (or relative entropy) between likely and desired outcomes, which involves both pursuing the goal-state that has the highest expected utility (often termed exploitation) and visiting a number of different goal-states (exploration). Crucially, the opportunity to visit new states increases the value of the current state. Casting decision-making problems within a variational framework, therefore, predicts that our behavior is governed by both the entropy and expected utility of future states. This dissolves any dialectic between minimizing surprise and exploration or novelty seeking.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据