Deep neural networks in Russian speech recognition

8Citations
Citations of this article
24Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Hybrid speech recognition systems incorporating deep neural networks (DNNs) with Hidden Markov Models/Gaussian Mixture Models have achieved good results. We propose applying various DNNs in automatic recognition of Russian continuous speech. We used different neural network models such as Convolutional Neural Networks (CNNs), modifications of Long short-term memory (LSTM), Residual Networks and Recurrent Convolutional Networks (RCNNs). The presented model achieved 7.5% reducing of word error rate (WER) compared with Kaldi baseline. Experiments are performed with extra-large vocabulary (more than 30 h) of Russian speech.

Cite

CITATION STYLE

APA

Markovnikov, N., Kipyatkova, I., Karpov, A., & Filchenkov, A. (2018). Deep neural networks in Russian speech recognition. In Communications in Computer and Information Science (Vol. 789, pp. 54–67). Springer Verlag. https://doi.org/10.1007/978-3-319-71746-3_5

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free