Segmentation of telephone speech based on speech and non-speech models

4Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

In this paper we investigate the automatic segmentation of recorded telephone conversations based on models for speech and non-speech to find sentence-like chunks for use in speech recognition systems. Presented are two different approaches, based on Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs), respectively. The proposed methods provide segmentations that allow for competitive speech recognition performance in terms of word error rate (WER) compared to manual segmentation. © 2013 Springer International Publishing.

Cite

CITATION STYLE

APA

Heck, M., Mohr, C., Stüker, S., Müller, M., Kilgour, K., Gehring, J., … Waibel, A. (2013). Segmentation of telephone speech based on speech and non-speech models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8113 LNAI, pp. 286–293). https://doi.org/10.1007/978-3-319-01931-4_38

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free