Recognition of multiple language voice navigation queries in traffic situations

Gellért Sárosi; Tamás Mozsolics; Balázs Tarján; András Balog; Péter Mihajlik; Tibor Fegyó

Conference Proceedings

Recognition of multiple language voice navigation queries in traffic situations

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6800 LNCS 199-213

DOI: 10.1007/978-3-642-25775-9_20

2Citations

3Readers

Get full text

Abstract

This paper introduces our work and results related to a multiple language continuous speech recognition task. The aim was to design a system that introduces tolerable amount of recognition errors for point of interest words in voice navigational queries even in the presence of real-life traffic noise. Additional challenges were that no task-specific training databases were available for language and acoustic modeling. Instead, general purpose acoustic database were obtained and (probabilistic) context free grammars were constructed for the acoustic and language models, respectively. Public pronunciation lexicon was used for the English language, whereas rule- and exception dictionary based pronunciation modeling was applied for French, German, Italian, Spanish and Hungarian. For the last four languages the classical phoneme-based pronunciation modeling approach was compared to grapheme-based pronunciation modeling technique, as well. Noise robustness was addressed by applying various feature extraction methods. The results show that achieving high word recognition accuracy is feasible if cooperative speakers can be assumed. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Sárosi, G., Mozsolics, T., Tarján, B., Balog, A., Mihajlik, P., & Fegyó, T. (2011). Recognition of multiple language voice navigation queries in traffic situations. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6800 LNCS, pp. 199–213). https://doi.org/10.1007/978-3-642-25775-9_20

Recognition of multiple language voice navigation queries in traffic situations

Abstract

Author supplied keywords

Cite

Register to see more suggestions