Special speech synthesis for social network websites

Csaba Zainkó; Tamás Gábor Csapó; Géza Németh

Conference Proceedings

Special speech synthesis for social network websites

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2010) 6231 LNAI 455-463

DOI: 10.1007/978-3-642-15760-8_58

2Citations

10Readers

Get full text

Abstract

This paper gives an overview of the design concepts and implementation of a Hungarian microblog reading system. Speech synthesis of such special text requires some special components. First, an efficient diacritic reconstruction algorithm was applied. The accuracy of a former dictionary-based method was improved by machine learning to handle ambiguous cases properly. Second, an unlimited domain text-to-speech synthesizer was applied with extensions for emotional and spontaneous styles. Chat or blog texts often contain "emoticons" which mark the emotional state of the user. Therefore, an expressive speech synthesis method was adapted to a corpus-based synthesizer. Four emotions were generated and evaluated in a listening test: neutral, happy, angry and sad. The results of the experiments showed that happy and sad emotions can be generated with this algorithm, with best accuracy for female voice. © 2010 Springer-Verlag Berlin Heidelberg.

Author supplied keywords

Cite

CITATION STYLE

APA

Zainkó, C., Csapó, T. G., & Németh, G. (2010). Special speech synthesis for social network websites. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6231 LNAI, pp. 455–463). https://doi.org/10.1007/978-3-642-15760-8_58

Special speech synthesis for social network websites

Abstract

Author supplied keywords

Cite

Register to see more suggestions