Time-delay recurrent neural network for cross-lingual speech recognition

2Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The speech recognition on low-resource language is a research hotspot. In this paper, on the basis of transfer learning, we propose a cross-lingual speech recognition system based on time-delay recurrent neural network. This network is comprised of deep belief time-delay neural network layer interleaved with long short-term memory recurrent neural network layer. We first train the whole merged network with a large labeled corpus of source language and afterward retrain the hidden layers using small target language corpus with both cross-entropy and lattice-free MMI objective function to enhance the recognition performance on the target language. Experiments show that the proposed system outperforms LSTM and DBN-DNN baseline system on THCHS30 corpus.

Cite

CITATION STYLE

APA

Mao, X., & Zhang, Y. (2019). Time-delay recurrent neural network for cross-lingual speech recognition. In Advances in Intelligent Systems and Computing (Vol. 752, pp. 341–348). Springer Verlag. https://doi.org/10.1007/978-981-10-8944-2_40

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free