Segmentation of telephone speech based on speech and non-speech models

Michael Heck; Christian Mohr; Sebastian Stüker; Markus Müller; Kevin Kilgour; Jonas Gehring; Quoc Bao Nguyen; Van Huy Nguyen; Alex Waibel

Conference Proceedings

Segmentation of telephone speech based on speech and non-speech models

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 8113 LNAI 286-293

DOI: 10.1007/978-3-319-01931-4_38

4Citations

5Readers

Get full text

Abstract

In this paper we investigate the automatic segmentation of recorded telephone conversations based on models for speech and non-speech to find sentence-like chunks for use in speech recognition systems. Presented are two different approaches, based on Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs), respectively. The proposed methods provide segmentations that allow for competitive speech recognition performance in terms of word error rate (WER) compared to manual segmentation. © 2013 Springer International Publishing.

Author supplied keywords

Cite

CITATION STYLE

APA

Heck, M., Mohr, C., Stüker, S., Müller, M., Kilgour, K., Gehring, J., … Waibel, A. (2013). Segmentation of telephone speech based on speech and non-speech models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8113 LNAI, pp. 286–293). https://doi.org/10.1007/978-3-319-01931-4_38

Segmentation of telephone speech based on speech and non-speech models

Abstract

Author supplied keywords

Cite

Register to see more suggestions