Abstract
We use machine learners trained on a combination of acoustic confidence and pragmatic plausibility features computed from dialogue context to predict the accuracy of incoming n-best recognition hypotheses to a spoken dialogue system. Our best results show a 25% weighted f-score improvement over a baseline system that implements a “grammar-switching” approach to context-sensitive speech recognition.
Cite
CITATION STYLE
Gabsdil, M., & Lemon, O. (2004). Combining acoustic and pragmatic features to predict recognition performance in spoken dialogue systems. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (pp. 343–350). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1218955.1218999
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.