Projection of Turn Completion in Incremental Spoken Dialogue Systems

3Citations
Citations of this article
51Readers
Mendeley users who have this article in their library.

Abstract

The ability to take turns in a fluent way (i.e., without long response delays or frequent interruptions) is a fundamental aspect of any spoken dialog system. However, practical speech recognition services typically induce a long response delay, as it takes time before the processing of the user's utterance is complete. There is a considerable amount of research indicating that humans achieve fast response times by projecting what the interlocutor will say and estimating upcoming turn completions. In this work, we implement this mechanism in an incremental spoken dialog system, by using a language model that generates possible futures to project upcoming completion points. In theory, this could make the system more responsive, while still having access to semantic information not yet processed by the speech recognizer. We conduct a small study which indicates that this is a viable approach for practical dialog systems, and that this is a promising direction for future research.

Cite

CITATION STYLE

APA

Ekstedt, E., & Skantze, G. (2021). Projection of Turn Completion in Incremental Spoken Dialogue Systems. In SIGDIAL 2021 - 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue, Proceedings of the Conference (pp. 431–437). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.sigdial-1.45

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free