Challenges in speech processing of Slavic languages (case studies in speech recognition of Czech and Slovak)

26Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Slavic languages pose a big challenge for researchers dealing with speech technology. They exhibit a large degree of inflection, namely declension of nouns, pronouns and adjectives, and conjugation of verbs. This has a large impact on the size of lexical inventories in these languages, and significantly complicates the design of text-to-speech and, in particular, speech-to-text systems. In the paper, we demonstrate some of the typical features of the Slavic languages and show how they can be handled in the development of practical speech processing systems. We present our solutions we applied in the design of voice dictation and broadcast speech transcription systems developed for Czech. Furthermore, we demonstrate how these systems can be converted to another similar Slavic language, in our case Slovak. All the presented systems operate in real time with very large vocabularies (350K words in Czech, 170K words in Slovak) and some of them have been already deployed in practice. © 2010 Springer-Verlag.

Cite

CITATION STYLE

APA

Nouza, J., Zdansky, J., Cerva, P., & Silovsky, J. (2010). Challenges in speech processing of Slavic languages (case studies in speech recognition of Czech and Slovak). In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5967 LNCS, pp. 225–241). Springer Verlag. https://doi.org/10.1007/978-3-642-12397-9_19

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free