Automatic emphasis labeling for emotional speech by measuring prosody generation error

2Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Emotion helps human to express their feelings and intentions clearly. And the emphasis labels of speeches are the key of speech emotion analysis and synthesis. In order to label the emotion emphasis of speech samples from a corpus with only phonetic and prosodic information, this paper introduces an automatic labeling algorithm by measuring the prosody generation error (PGE) of the result from a statistical synthesizer. Classification and Regression Tree (CART) and Maximum Entropy (ME) modeling are adopted for automatically labeling. Experiment shows that both models are helpful for labeling. © 2009 Springer Berlin Heidelberg.

Cite

CITATION STYLE

APA

Xu, J., & Cai, L. H. (2009). Automatic emphasis labeling for emotional speech by measuring prosody generation error. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5754 LNCS, pp. 177–186). https://doi.org/10.1007/978-3-642-04070-2_20

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free