Using conditional random fields to predict pitch accents in conversational speech

44Citations
Citations of this article
119Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The detection of prosodic characteristics is an important aspect of both speech synthesis and speech recognition. Correct placement of pitch accents aids in more natural sounding speech, while automatic detection of accents can contribute to better word-level recognition and better textual understanding. In this paper we investigate probabilistic, contextual, and phonological factors that influence pitch accent placement in natural, conversational speech in a sequence labeling setting. We introduce Conditional Random Fields (CRFs) to pitch accent prediction task in order to incorporate these factors efficiently in a sequence model. We demonstrate the usefulness and the incremental effect of these factors in a sequence model by performing experiments on hand labeled data from the Switchboard Corpus. Our model outperforms the baseline and previous models of pitch accent prediction on the Switchboard Corpus.

Cite

CITATION STYLE

APA

Gregory, M. L., & Altun, Y. (2004). Using conditional random fields to predict pitch accents in conversational speech. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 677–683). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1218955.1219041

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free