Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder

H. Wang; Y. Zhao; X.F. Liu; X.N. Xu; L. Wang; N. Zhou; Y.M. Xu

Conference ProceedingsOPEN ACCESS

Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder

Wang H
Zhao Y
Liu X
et al.

Proceedings of the 2015 International Conference on Electrical, Automation and Mechanical Engineering (2015) 13

DOI: 10.2991/eame-15.2015.95

N/ACitations

6Readers

Abstract

HMM models based on MFCC features are widely used by researchers in Tibetan speech recognition. Although the shallow models of HMM are effective, they cannot reflect the speech perceptual mechanism in human being's brain. In this paper, we propose to apply sparse auto-encoder to learn deep features based on MFCC features for speech data. The deep features not only simulate sparse touches signal of the auditory nerve, and are significant to improve speech recognition accuracy with HMM models. Experimental results show that the deep features learned by sparse auto-encoder perform better on Tibetan speech recognition than MFCC features and the deep features learned by MLP.

Cite

CITATION STYLE

APA

Wang, H., Zhao, Y., Liu, X. F., Xu, X. N., Wang, L., Zhou, N., & Xu, Y. M. (2015). Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder. In Proceedings of the 2015 International Conference on Electrical, Automation and Mechanical Engineering (Vol. 13). Atlantis Press. https://doi.org/10.2991/eame-15.2015.95

Deep Feature Learning for Tibetan Speech Recognition using Sparse Auto-encoder

Abstract

Cite

Register to see more suggestions