Fast and accurate misspelling correction in large corpora

4Citations
Citations of this article
99Readers
Mendeley users who have this article in their library.
Get full text

Abstract

There are several NLP systems whose accuracy depends crucially on finding misspellings fast. However, the classical approach is based on a quadratic time algorithm with 80% coverage. We present a novel algorithm for misspelling detection, which runs in constant time and improves the coverage to more than 96%. We use this algorithm together with a cross document coreference system in order to find proper name misspellings. The experiments confirmed significant improvement over the state of the art.

Cite

CITATION STYLE

APA

Popescu, O., & Vo, N. P. A. (2014). Fast and accurate misspelling correction in large corpora. In EMNLP 2014 - 2014 Conference on Empirical Methods in Natural Language Processing, Proceedings of the Conference (pp. 1634–1642). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/d14-1171

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free