Language adaptive multilingual CTC speech recognition

Markus Müller; Sebastian Stüker; Alex Waibel

Conference Proceedings

Language adaptive multilingual CTC speech recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2017) 10458 LNAI 473-482

DOI: 10.1007/978-3-319-66429-3_47

13Citations

10Readers

Get full text

Abstract

Recently, it has been demonstrated that speech recognition systems are able to achieve human parity. While much research is done for resource-rich languages like English, there exists a long tail of languages for which no speech recognition systems do yet exist. The major obstacle in building systems for new languages is the lack of available resources. In the past, several methods have been proposed to build systems in low-resource conditions by using data from additional source languages during training. While it has been shown that DNN/HMM hybrid setups trained in low-resource conditions benefit from additional data, we are proposing a similar technique using sequence based neural network acoustic models with Connectionist Temporal Classification (CTC) loss function. We demonstrate that setups with multilingual phone sets benefit from the addition of Language Feature Vectors (LFVs).

Author supplied keywords

Cite

CITATION STYLE

APA

Müller, M., Stüker, S., & Waibel, A. (2017). Language adaptive multilingual CTC speech recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10458 LNAI, pp. 473–482). Springer Verlag. https://doi.org/10.1007/978-3-319-66429-3_47

Language adaptive multilingual CTC speech recognition

Abstract

Author supplied keywords

Cite

Register to see more suggestions