Hybrid speech recognition systems incorporating deep neural networks (DNNs) with Hidden Markov Models/Gaussian Mixture Models have achieved good results. We propose applying various DNNs in automatic recognition of Russian continuous speech. We used different neural network models such as Convolutional Neural Networks (CNNs), modifications of Long short-term memory (LSTM), Residual Networks and Recurrent Convolutional Networks (RCNNs). The presented model achieved 7.5% reducing of word error rate (WER) compared with Kaldi baseline. Experiments are performed with extra-large vocabulary (more than 30 h) of Russian speech.
CITATION STYLE
Markovnikov, N., Kipyatkova, I., Karpov, A., & Filchenkov, A. (2018). Deep neural networks in Russian speech recognition. In Communications in Computer and Information Science (Vol. 789, pp. 54–67). Springer Verlag. https://doi.org/10.1007/978-3-319-71746-3_5
Mendeley helps you to discover research relevant for your work.