The speech recognition on low-resource language is a research hotspot. In this paper, on the basis of transfer learning, we propose a cross-lingual speech recognition system based on time-delay recurrent neural network. This network is comprised of deep belief time-delay neural network layer interleaved with long short-term memory recurrent neural network layer. We first train the whole merged network with a large labeled corpus of source language and afterward retrain the hidden layers using small target language corpus with both cross-entropy and lattice-free MMI objective function to enhance the recognition performance on the target language. Experiments show that the proposed system outperforms LSTM and DBN-DNN baseline system on THCHS30 corpus.
CITATION STYLE
Mao, X., & Zhang, Y. (2019). Time-delay recurrent neural network for cross-lingual speech recognition. In Advances in Intelligent Systems and Computing (Vol. 752, pp. 341–348). Springer Verlag. https://doi.org/10.1007/978-981-10-8944-2_40
Mendeley helps you to discover research relevant for your work.