Automatic segmentation of parasitic sounds in speech corpora for TTS synthesis

Jindřich Matoušek

Conference Proceedings

Automatic segmentation of parasitic sounds in speech corpora for TTS synthesis

Matoušek J

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6231 LNAI 369-376

DOI: 10.1007/978-3-642-15760-8_47

1Citations

4Readers

Get full text

Abstract

In this paper, automatic segmentation of parasitic speech sounds in speech corpora for text-to-speech (TTS) synthesis is presented. The automatic segmentation is, beside the automatic detection of the presence of such sounds in speech corpora, an important step in the precise localisation of parasitic sounds in speech corpora. The main goal of this study is to find out whether the segmentation of these sounds is accurate enough to enable cutting the sounds out of synthetic speech or explicit modelling of these sounds during synthesis. HMM-based classifier was employed to detect the parasitic sounds and to find the boundaries between these sounds and the surrounding phones simultaneously. The results show that the automatic segmentation of parasitic sounds is comparable to the segmentation of other phones, which indicates that the cutting out or the explicit usage of parasitic sounds should be possible. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Matoušek, J. (2010). Automatic segmentation of parasitic sounds in speech corpora for TTS synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6231 LNAI, pp. 369–376). https://doi.org/10.1007/978-3-642-15760-8_47

Automatic segmentation of parasitic sounds in speech corpora for TTS synthesis

Abstract

Author supplied keywords

Cite

Register to see more suggestions