We use character-based statistical machine translation in order to correct user search queries in the e-commerce domain. The training data is automatically extracted from event logs where users re-issue their search queries with potentially corrected spelling within the same session. We show results on a test set which was annotated by humans and compare against online autocorrection capabilities of three additional web sites. Overall, the methods presented in this paper outperform fully productized spellchecking and autocorrection services in terms of accuracy and FI score. We also propose novel evaluation steps based on retrieved search results of the corrected queries in terms of quantity and relevance.
CITATION STYLE
Hasan, S., Heger, C., & Mansour, S. (2015). Spelling correction of user search queries through statistical machine translation. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 451–460). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1051
Mendeley helps you to discover research relevant for your work.