Mongolian speech recognition based on deep neural networks

Hui Zhang; Feilong Bao; Guanglai Gao

Conference Proceedings

Mongolian speech recognition based on deep neural networks

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9427 180-188

DOI: 10.1007/978-3-319-25816-4_15

11Citations

5Readers

Get full text

Abstract

Mongolian is an influential language. And better Mongolian Large Vocabulary Continuous Speech Recognition (LVCSR) systems are required. Recently, the research of speech recognition has achieved a big improvement by introducing the Deep Neural Networks (DNNs). In this study, a DNN-based Mongolian LVCSR system is built. Experimental results show that the DNN-based models outperform the conventional models which based on Gaussian Mixture Models (GMMs) for the Mongolian speech recognition, by a large margin. Compared with the best GMM-based model, the DNN-based one obtains a relative improvement over 50%. And it becomes a new state-of-the-art system in this field.

Author supplied keywords

Cite

CITATION STYLE

APA

Zhang, H., Bao, F., & Gao, G. (2015). Mongolian speech recognition based on deep neural networks. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9427, pp. 180–188). Springer Verlag. https://doi.org/10.1007/978-3-319-25816-4_15

Mongolian speech recognition based on deep neural networks

Abstract

Author supplied keywords

Cite

Register to see more suggestions