An open source speech synthesis frontend for HTS

Markus Toman; Michael Pucher

Conference Proceedings

An open source speech synthesis frontend for HTS

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9302 291-298

DOI: 10.1007/978-3-319-24033-6_33

1Citations

4Readers

Get full text

Abstract

This paper describes a software framework for HMM-based speech synthesis that we have developed and released to the public. The framework is compatible to the well-known HTS toolkit by incorporating Hts_engine and Flite. It enables HTS voices to be used as Microsoft Windows system voices and to be integrated into Android and iOS apps. Non- English languages are supported through the capability to load Festival format pronunciation dictionaries and letter to sound rules. The release also includes an Austrian German voice model of a male, professional speaker recorded in studio quality as well as pronunciation dictionary, letter to sound rules and basic text preprocessing procedures for Austrian German. The framework is available under an MIT-style license.

Author supplied keywords

Cite

CITATION STYLE

APA

Toman, M., & Pucher, M. (2015). An open source speech synthesis frontend for HTS. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9302, pp. 291–298). Springer Verlag. https://doi.org/10.1007/978-3-319-24033-6_33

An open source speech synthesis frontend for HTS

Abstract

Author supplied keywords

Cite

Register to see more suggestions