Time-delay recurrent neural network for cross-lingual speech recognition

Xia Mao; Yulv Zhang

Conference Proceedings

Time-delay recurrent neural network for cross-lingual speech recognition

Advances in Intelligent Systems and Computing (2019) 752 341-348

DOI: 10.1007/978-981-10-8944-2_40

2Citations

6Readers

Get full text

Abstract

The speech recognition on low-resource language is a research hotspot. In this paper, on the basis of transfer learning, we propose a cross-lingual speech recognition system based on time-delay recurrent neural network. This network is comprised of deep belief time-delay neural network layer interleaved with long short-term memory recurrent neural network layer. We first train the whole merged network with a large labeled corpus of source language and afterward retrain the hidden layers using small target language corpus with both cross-entropy and lattice-free MMI objective function to enhance the recognition performance on the target language. Experiments show that the proposed system outperforms LSTM and DBN-DNN baseline system on THCHS30 corpus.

Author supplied keywords

Cite

CITATION STYLE

APA

Mao, X., & Zhang, Y. (2019). Time-delay recurrent neural network for cross-lingual speech recognition. In Advances in Intelligent Systems and Computing (Vol. 752, pp. 341–348). Springer Verlag. https://doi.org/10.1007/978-981-10-8944-2_40

Time-delay recurrent neural network for cross-lingual speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions