4.7 Article

Automatic Spoken Language Acquisition Based on Observation and Dialogue

期刊

出版社

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/JSTSP.2022.3189279

关键词

Vocabulary; Speech recognition; Grounding; Reinforcement learning; Uniform resource locators; Signal processing algorithms; Routing; Autonomous agent; reinforcement learning; self-supervised learning; spoken language acquisition; unsupervised learning

资金

  1. Toray Science Foundation
  2. JSPS KAKENHI [JP22K12069]

向作者/读者索取更多资源

Researchers propose spoken language acquisition agents that simulate the process of human language learning. By integrating multiple learning types, the agents successfully acquire spoken language from scratch and improve learning efficiency.
Human babies are born without knowledge of any specific language. They acquire language directly from observation and dialogue without being limited by the availability of labeled data. We propose spoken language acquisition agents that simulate the process. Such an ability requires multiple types of learning, including 1) word discovery, 2) symbol grounding, 3) message generation, and 4) pronunciation generation. Several studies have targeted one or combined learning types to elucidate human intelligence and aimed to equip spoken dialogue systems with human-like flexible language learning ability. However, their language ability was partially lacking some of the components. Our agents are the first to integrate them all. Our key concept is to design an architecture to integrate unsupervised, self-supervised, and reinforcement learning to utilize clues naturally existing in raw sensory signals and drive the learning based on the agent's intrinsic motivation. Experimental results show agents successfully acquire spoken language from scratch by interacting with an environment to act by speaking. Our proposed focusing mechanism significantly improves learning efficiency. We also demonstrate that our agents can learn neural vocoder and the concept of logical negation as a part of language acquisition.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据