In this paper we investigate the automatic segmentation of recorded telephone conversations based on models for speech and non-speech to find sentence-like chunks for use in speech recognition systems. Presented are two different approaches, based on Gaussian Mixture Models (GMMs) and Support Vector Machines (SVMs), respectively. The proposed methods provide segmentations that allow for competitive speech recognition performance in terms of word error rate (WER) compared to manual segmentation. © 2013 Springer International Publishing.
CITATION STYLE
Heck, M., Mohr, C., Stüker, S., Müller, M., Kilgour, K., Gehring, J., … Waibel, A. (2013). Segmentation of telephone speech based on speech and non-speech models. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8113 LNAI, pp. 286–293). https://doi.org/10.1007/978-3-319-01931-4_38
Mendeley helps you to discover research relevant for your work.