Investigating the global semantic impact of speech recognition error on spoken content collections

Martha Larson; Manos Tsagkias; Jiyin He; Maarten De Rijke

Conference Proceedings

Investigating the global semantic impact of speech recognition error on spoken content collections

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5478 LNCS 755-760

DOI: 10.1007/978-3-642-00958-7_80

4Citations

6Readers

Get full text

Abstract

Errors in speech recognition transcripts have a negative impact on effectiveness of content-based speech retrieval and present a particular challenge for collections containing conversational spoken content. We propose a Global Semantic Distortion (GSD) metric that measures the collection-wide impact of speech recognition error on spoken content retrieval in a query-independent manner. We deploy our metric to examine the effects of speech recognition substitution errors. First, we investigate frequent substitutions, cases in which the recognizer habitually mis-transcribes one word as another. Although habitual mistakes have a large global impact, the long tail of rare substitutions has a more damaging effect. Second, we investigate semantically similar substitutions, cases in which the word spoken and the word recognized do not diverge radically in meaning. Similar substitutions are shown to have slightly less global impact than semantically dissimilar substitutions. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Larson, M., Tsagkias, M., He, J., & De Rijke, M. (2009). Investigating the global semantic impact of speech recognition error on spoken content collections. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5478 LNCS, pp. 755–760). https://doi.org/10.1007/978-3-642-00958-7_80

Investigating the global semantic impact of speech recognition error on spoken content collections

Abstract

Cite

Register to see more suggestions