Abstract
There are several NLP systems whose accuracy depends crucially on finding misspellings fast. However, the classical approach is based on a quadratic time algorithm with 80% coverage. We present a novel algorithm for misspelling detection, which runs in constant time and improves the coverage to more than 96%. We use this algorithm together with a cross document coreference system in order to find proper name misspellings. The experiments confirmed significant improvement over the state of the art.
Cite
CITATION STYLE
Popescu, O., & Vo, N. P. A. (2014). Fast and accurate misspelling correction in large corpora. In EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 1634–1642). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/d14-1171
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.