End-to-end large vocabulary speech recognition for the serbian language

Branislav Popović; Edvin Pakoci; Darko Pekar

Conference Proceedings

End-to-end large vocabulary speech recognition for the serbian language

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10458 LNAI 343-352

DOI: 10.1007/978-3-319-66429-3_33

9Citations

2Readers

Get full text

Abstract

This paper presents the results of a large vocabulary speech recognition for the Serbian language, developed by using Eesen end-to-end framework. Eesen involves training a single deep recurrent neural network, containing a number of bidirectional long short-term memory layers, modeling the connection between the speech and a set of context-independent lexicon units. This approach reduces the amount of expert knowledge needed in order to develop other competitive speech recognition systems. The training is based on a connectionist temporal classification, while decoding allows the usage of weighted finite-state transducers. This provides much faster and more efficient decoding in comparison to other similar systems. A corpus of approximately 215 h of audio data (about 171 h of speech and 44 h of silence, or 243 male and 239 female speakers) was employed for the training (about 90%) and testing (about 10%) purposes. On a set of more than 120000 words, the word error rate of 14.68% and the character error rate of 3.68% is achieved.

Author supplied keywords

Cite

CITATION STYLE

APA

Popović, B., Pakoci, E., & Pekar, D. (2017). End-to-end large vocabulary speech recognition for the serbian language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10458 LNAI, pp. 343–352). Springer Verlag. https://doi.org/10.1007/978-3-319-66429-3_33

End-to-end large vocabulary speech recognition for the serbian language

Abstract

Author supplied keywords

Cite

Register to see more suggestions