☆ 4.3 Article

From internal models toward metacognitive AI

BIOLOGICAL CYBERNETICS (2021)

期刊

BIOLOGICAL CYBERNETICS

卷 115, 期 5, 页码 415-430

出版社

SPRINGER

DOI: 10.1007/s00422-021-00904-7

关键词

Internal models; Forward and inverse models; Cerebellum; Prefrontal cortex; Metacognition; Consciousness; Artificial intelligence; Hierarchical reinforcement learning

类别

Computer Science, Cybernetics Neurosciences

资金

Innovative Science and Technology Initiative for Security, Acquisition, Technology & Logistics Agency (ATLA), Japan [JP004596]
Japan Agency for Medical Research and Development (AMED) [JP21dm0307008]
ERATO, Japan Science and Technology Agency (JST) [JPMJER1801]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

Based on a review of literature from the past 20 years, a computational neuroscience model of metacognition is proposed, utilizing generative-inverse model pairs and a cognitive reality monitoring network (CRMN) in the prefrontal cortex. This model allows for the selection, learning, and monitoring of modules in perception, action, and reinforcement learning based on mismatches between generative and inverse models, as well as reward prediction errors.

In several papers published in Biological Cybernetics in the 1980s and 1990s, Kawato and colleagues proposed computational models explaining how internal models are acquired in the cerebellum. These models were later supported by neurophysiological experiments using monkeys and neuroimaging experiments involving humans. These early studies influenced neuroscience from basic, sensory-motor control to higher cognitive functions. One of the most perplexing enigmas related to internal models is to understand the neural mechanisms that enable animals to learn large-dimensional problems with so few trials. Consciousness and metacognition-the ability to monitor one's own thoughts, may be part of the solution to this enigma. Based on literature reviews of the past 20 years, here we propose a computational neuroscience model of metacognition. The model comprises a modular hierarchical reinforcement-learning architecture of parallel and layered, generative-inverse model pairs. In the prefrontal cortex, a distributed executive network called the cognitive reality monitoring network (CRMN) orchestrates conscious involvement of generative-inverse model pairs in perception and action. Based on mismatches between computations by generative and inverse models, as well as reward prediction errors, CRMN computes a responsibility signal that gates selection and learning of pairs in perception, action, and reinforcement learning. A high responsibility signal is given to the pairs that best capture the external world, that are competent in movements (small mismatch), and that are capable of reinforcement learning (small reward-prediction error). CRMN selects pairs with higher responsibility signals as objects of metacognition, and consciousness is determined by the entropy of responsibility signals across all pairs. This model could lead to new-generation AI, which exhibits metacognition, consciousness, dimension reduction, selection of modules and corresponding representations, and learning from small samples. It may also lead to the development of a new scientific paradigm that enables the causal study of consciousness by combining CRMN and decoded neurofeedback.

From internal models toward metacognitive AI

期刊

BIOLOGICAL CYBERNETICS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

From internal models toward metacognitive AI

期刊

BIOLOGICAL CYBERNETICS

出版社

SPRINGER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文