Open source German distant speech recognition: Corpus and acoustic model

Stephan Radeck-Arneth; Benjamin Milde; Arvid Lange; Evandro Gouvêa; Stefan Radomski; Max Mühlhäuser; Chris Biemann

Conference Proceedings

Open source German distant speech recognition: Corpus and acoustic model

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9302 480-488

DOI: 10.1007/978-3-319-24033-6_54

31Citations

20Readers

Get full text

Abstract

We present a new freely available corpus for German distant speech recognition and report speaker-independent word error rate (WER) results for two open source speech recognizers trained on this corpus. The corpus has been recorded in a controlled environment with three different microphones at a distance of one meter. It comprises 180 different speakers with a total of 36 hours of audio recordings. We show recognition results with the open source toolkit Kaldi (20.5% WER) and PocketSphinx (39.6% WER) and make a complete open source solution for German distant speech recognition possible.

Author supplied keywords

Cite

CITATION STYLE

APA

Radeck-Arneth, S., Milde, B., Lange, A., Gouvêa, E., Radomski, S., Mühlhäuser, M., & Biemann, C. (2015). Open source German distant speech recognition: Corpus and acoustic model. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9302, pp. 480–488). Springer Verlag. https://doi.org/10.1007/978-3-319-24033-6_54

Open source German distant speech recognition: Corpus and acoustic model

Abstract

Author supplied keywords

Cite

Register to see more suggestions