4.3 Article

From internal models toward metacognitive AI

期刊

BIOLOGICAL CYBERNETICS
卷 115, 期 5, 页码 415-430

出版社

SPRINGER
DOI: 10.1007/s00422-021-00904-7

关键词

Internal models; Forward and inverse models; Cerebellum; Prefrontal cortex; Metacognition; Consciousness; Artificial intelligence; Hierarchical reinforcement learning

资金

  1. Innovative Science and Technology Initiative for Security, Acquisition, Technology & Logistics Agency (ATLA), Japan [JP004596]
  2. Japan Agency for Medical Research and Development (AMED) [JP21dm0307008]
  3. ERATO, Japan Science and Technology Agency (JST) [JPMJER1801]

向作者/读者索取更多资源

Based on a review of literature from the past 20 years, a computational neuroscience model of metacognition is proposed, utilizing generative-inverse model pairs and a cognitive reality monitoring network (CRMN) in the prefrontal cortex. This model allows for the selection, learning, and monitoring of modules in perception, action, and reinforcement learning based on mismatches between generative and inverse models, as well as reward prediction errors.
In several papers published in Biological Cybernetics in the 1980s and 1990s, Kawato and colleagues proposed computational models explaining how internal models are acquired in the cerebellum. These models were later supported by neurophysiological experiments using monkeys and neuroimaging experiments involving humans. These early studies influenced neuroscience from basic, sensory-motor control to higher cognitive functions. One of the most perplexing enigmas related to internal models is to understand the neural mechanisms that enable animals to learn large-dimensional problems with so few trials. Consciousness and metacognition-the ability to monitor one's own thoughts, may be part of the solution to this enigma. Based on literature reviews of the past 20 years, here we propose a computational neuroscience model of metacognition. The model comprises a modular hierarchical reinforcement-learning architecture of parallel and layered, generative-inverse model pairs. In the prefrontal cortex, a distributed executive network called the cognitive reality monitoring network (CRMN) orchestrates conscious involvement of generative-inverse model pairs in perception and action. Based on mismatches between computations by generative and inverse models, as well as reward prediction errors, CRMN computes a responsibility signal that gates selection and learning of pairs in perception, action, and reinforcement learning. A high responsibility signal is given to the pairs that best capture the external world, that are competent in movements (small mismatch), and that are capable of reinforcement learning (small reward-prediction error). CRMN selects pairs with higher responsibility signals as objects of metacognition, and consciousness is determined by the entropy of responsibility signals across all pairs. This model could lead to new-generation AI, which exhibits metacognition, consciousness, dimension reduction, selection of modules and corresponding representations, and learning from small samples. It may also lead to the development of a new scientific paradigm that enables the causal study of consciousness by combining CRMN and decoded neurofeedback.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.3
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据