How to add word classes to the Kaldi speech recognition toolkit

Axel Horndasch; Caroline Kaufhold; Elmar Nöth

Book Chapter

How to add word classes to the Kaldi speech recognition toolkit

Springer Verlag, (2016), 486-494

DOI: 10.1007/978-3-319-45510-5_56

4Citations

15Readers

Get full text

Abstract

The paper explains and illustrates how the concept of word classes can be added to the widely used open-source speech recognition toolkit Kaldi. The suggested extensions to existing Kaldi recipes are limited to the word-level grammar (G) and the pronunciation lexicon (L) models. The implementation to modify the weighted finite state transducers employed in Kaldi makes use of the OpenFST library. In experiments on small and mid-sized corpora with vocabulary sizes of 1.5 K and 5.5 K respectively a slight improvement of the word error rate is observed when the approach is tested with (hand-crafted) word classes. Furthermore it is shown that the introduction of sub-word unit models for open word classes can help to robustly detect and classify out-of-vocabulary words without impairing word recognition accuracy.

Author supplied keywords

Cite

CITATION STYLE

APA

Horndasch, A., Kaufhold, C., & Nöth, E. (2016). How to add word classes to the Kaldi speech recognition toolkit. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9924 LNCS, pp. 486–494). Springer Verlag. https://doi.org/10.1007/978-3-319-45510-5_56

How to add word classes to the Kaldi speech recognition toolkit

Abstract

Author supplied keywords

Cite

Register to see more suggestions