Deep neural networks in Russian speech recognition

Nikita Markovnikov; Irina Kipyatkova; Alexey Karpov; Andrey Filchenkov

Conference Proceedings

Deep neural networks in Russian speech recognition

Communications in Computer and Information Science (2018) 789 54-67

DOI: 10.1007/978-3-319-71746-3_5

8Citations

24Readers

Get full text

Abstract

Hybrid speech recognition systems incorporating deep neural networks (DNNs) with Hidden Markov Models/Gaussian Mixture Models have achieved good results. We propose applying various DNNs in automatic recognition of Russian continuous speech. We used different neural network models such as Convolutional Neural Networks (CNNs), modifications of Long short-term memory (LSTM), Residual Networks and Recurrent Convolutional Networks (RCNNs). The presented model achieved 7.5% reducing of word error rate (WER) compared with Kaldi baseline. Experiments are performed with extra-large vocabulary (more than 30 h) of Russian speech.

Author supplied keywords

Cite

CITATION STYLE

APA

Markovnikov, N., Kipyatkova, I., Karpov, A., & Filchenkov, A. (2018). Deep neural networks in Russian speech recognition. In Communications in Computer and Information Science (Vol. 789, pp. 54–67). Springer Verlag. https://doi.org/10.1007/978-3-319-71746-3_5

Deep neural networks in Russian speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions