Spoken document retrieval based on approximated sequence alignment

Pere R. Comas; Jordi Turmo

Conference Proceedings

Spoken document retrieval based on approximated sequence alignment

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2008) 5246 LNAI 285-292

DOI: 10.1007/978-3-540-87391-4_37

5Citations

10Readers

Get full text

Abstract

This paper presents a new approach to spoken document information retrieval for spontaneous speech corpora. The classical approach to this problem is the use of an automatic speech recognizer (ASR) combined with standard information retrieval techniques. However, ASRs tend to produce transcripts of spontaneous speech with significant word error rate, which is a drawback for standard retrieval techniques. To overcome such a limitation, our method is based on an approximated sequence alignment algorithm to search "sounds like" sequences. Our approach does not depend on extra information from the ASR and outperforms up to 7 points the precision of state-of-the-art techniques in our experiments. © 2008 Springer-Verlag Berlin Heidelberg.

Cite

CITATION STYLE

APA

Comas, P. R., & Turmo, J. (2008). Spoken document retrieval based on approximated sequence alignment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5246 LNAI, pp. 285–292). https://doi.org/10.1007/978-3-540-87391-4_37

Spoken document retrieval based on approximated sequence alignment

Abstract

Cite

Register to see more suggestions