4.5 Review

Turn-taking in Conversational Systems and Human-Robot Interaction: A Review

Journal

COMPUTER SPEECH AND LANGUAGE
Volume 67, Issue -, Pages -

Publisher

ACADEMIC PRESS LTD- ELSEVIER SCIENCE LTD
DOI: 10.1016/j.csl.2020.101178

Keywords

Turn-taking; Dialogue systems; Social robotics; Prosody; Gaze

Funding

  1. Swedish research council (VR) project Coordination of Attention and Turn -taking in Situated Interaction [2013-1403]

Ask authors/readers for more resources

Turn-taking is a fundamental aspect of dialogue, with humans being good at coordinating it while conversational systems often struggle with interruptions and delays. More research is needed to achieve fluent turn-taking in human-machine interaction.
The taking of turns is a fundamental aspect of dialogue. Since it is difficult to speak and listen at the same time, the participants need to coordinate who is currently speaking and when the next person can start to speak. Humans are very good at this coordination, and typically achieve fluent turn-taking with very small gaps and little overlap. Conversational systems (including voice assistants and social robots), on the other hand, typically have problems with frequent interruptions and long response delays, which has called for a substantial body of research on how to improve turn-taking in conversational systems. In this review article, we provide an overview of this research and give directions for future research. First, we provide a theoretical background of the linguistic research tradition on turn-taking and some of the fundamental concepts in theories of turn-taking. We also provide an extensive review of multi-modal cues (including verbal cues, prosody, breathing, gaze and gestures) that have been found to facilitate the coordination of turn-taking in human-human interaction, and which can be utilised for turn-taking in conversational systems. After this, we review work that has been done on modelling turn-taking, including end-of-turn detection, handling of user interruptions, generation of turn-taking cues, and multi-party human-robot interaction. Finally, we identify key areas where more research is needed to achieve fluent turn-taking in spoken interaction between man and machine. (C) 2020 The Author. Published by Elsevier Ltd.Y

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available