Learning prosodic sequences using the fundamental frequency variation spectrum

8Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.

Abstract

We investigate a recently introduced vector-valued representation of fundamental frequency variation, whose properties appear to be well-suited for statistical sequence modeling. We show what the representation looks like, and apply hidden Markov models to learn prosodic sequences characteristic of higher-level turn-taking phenomena. Our analysis shows that the models learn exactly those characteristics which have been reported for the phenomena in the literature. Further refinements to the representation lead to a 12-17% relative improvement in speaker change prediction for conversational spoken dialogue systems.

Cite

CITATION STYLE

APA

Laskowski, K., Edlund, J., & Heldner, M. (2008). Learning prosodic sequences using the fundamental frequency variation spectrum. In Proceedings of the 4th International Conference on Speech Prosody, SP 2008 (pp. 151–154). International Speech Communications Association. https://doi.org/10.21437/speechprosody.2008-36

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free