Using prosody for automatic sentence segmentation of multi-party meetings

Jáchym Kolář; Elizabeth Shriberg; Yang Liu

Conference Proceedings

Using prosody for automatic sentence segmentation of multi-party meetings

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2006) 4188 LNCS 629-636

DOI: 10.1007/11846406_79

16Citations

17Readers

Get full text

Abstract

We explore the use of prosodic features beyond pauses, including duration, pitch, and energy features, for automatic sentence segmentation of ICSI meeting data. We examine two different approaches to boundary classification: score-level combination of independent language and prosodic models using HMMs, and feature-level combination of models using a boosting-based method (BoosTexter). We report classification results for reference word transcripts as well as for transcripts from a state-of-the-art automatic speech recognizer (ASR). We also compare results using the lexical model plus a pause-only prosody model, versus results using additional prosodic features. Results show that (1) information from pauses is important, including pause duration both at the boundary and at the previous and following word boundaries; (2) adding duration, pitch, and energy features yields significant improvement over pause alone; (3) the integrated boosting-based model performs better than the HMM for ASR conditions; (4) training the boosting-based model on recognized words yields further improvement. © Springer-Verlag Berlin Heidelberg 2006.

Cite

CITATION STYLE

APA

Kolář, J., Shriberg, E., & Liu, Y. (2006). Using prosody for automatic sentence segmentation of multi-party meetings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 4188 LNCS, pp. 629–636). Springer Verlag. https://doi.org/10.1007/11846406_79

Using prosody for automatic sentence segmentation of multi-party meetings

Abstract

Cite

Register to see more suggestions