Optimizing phonetic encoding for viennese unit selection speech synthesis

Michael Pucher; Friedrich Neubarth; Volker Strom

Conference Proceedings

Optimizing phonetic encoding for viennese unit selection speech synthesis

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 5967 LNCS 207-216

DOI: 10.1007/978-3-642-12397-9_17

1Citations

3Readers

Get full text

Abstract

While developing lexical resources for a particular language variety (Viennese), we experimented with a set of 5 different phonetic encodings, termed phone sets, used for unit selection speech synthesis. We started with a very rich phone set based on phonological considerations and covering as much phonetic variability as possible, which was then reduced to smaller sets by applying transformation rules that map or merge phone symbols. The optimal trade-off was found measuring the phone error rates of automatically learnt grapheme-to-phone rules and by a perceptual evaluation of 27 representative synthesized sentences. Further, we describe a method to semi-automatically enlarge the lexical resources for the target language variety using a lexicon base for Standard Austrian German. © 2010 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Pucher, M., Neubarth, F., & Strom, V. (2010). Optimizing phonetic encoding for viennese unit selection speech synthesis. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5967 LNCS, pp. 207–216). Springer Verlag. https://doi.org/10.1007/978-3-642-12397-9_17

Optimizing phonetic encoding for viennese unit selection speech synthesis

Abstract

Author supplied keywords

Cite

Register to see more suggestions