Combining word and phonetic-code representations for spoken document retrieval

Alejandro Reyes-Barragán; Manuel Montes-Y-Gómez; Luis Villaseñor-Pineda

Conference Proceedings

Combining word and phonetic-code representations for spoken document retrieval

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6609 LNCS(PART 2) 458-466

DOI: 10.1007/978-3-642-19437-5_38

2Citations

3Readers

Get full text

Abstract

The traditional approach for spoken document retrieval (SDR) uses an automatic speech recognizer (ASR) in combination with a word-based information retrieval method. This approach has only showed limited accuracy, partially because ASR systems tend to produce transcriptions of spontaneous speech with significant word error rate. In order to overcome such limitation we propose a method which uses word and phonetic-code representations in collaboration. The idea of this combination is to reduce the impact of transcription errors in the processing of some (presumably complex) queries by representing words with similar pronunciations through the same phonetic code. Experimental results on the CLEF-CLSR-2007 corpus are encouraging; the proposed hybrid method improved the mean average precision and the number of retrieved relevant documents from the traditional word-based approach by 3% and 7% respectively. © 2011 Springer-Verlag.

Cite

CITATION STYLE

APA

Reyes-Barragán, A., Montes-Y-Gómez, M., & Villaseñor-Pineda, L. (2011). Combining word and phonetic-code representations for spoken document retrieval. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6609 LNCS, pp. 458–466). https://doi.org/10.1007/978-3-642-19437-5_38

Combining word and phonetic-code representations for spoken document retrieval

Abstract

Cite

Register to see more suggestions