Automatic spelling correction for resource-scarce languages using deep learning

Pravallika Etoori; Manoj Chinnakotla; Radhika Mamidi

Conference ProceedingsOPEN ACCESS

Automatic spelling correction for resource-scarce languages using deep learning

ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (2018) 146-152

DOI: 10.18653/v1/p18-3021

68Citations

184Readers

Abstract

Spelling correction is a well-known task in Natural Language Processing (NLP). Automatic spelling correction is important for many NLP applications like web search engines, text summarization, sentiment analysis etc. Most approaches use parallel data of noisy and correct word mappings from different sources as training data for automatic spelling correction. Indic languages are resourcescarce and do not have such parallel data due to low volume of queries and nonexistence of such prior implementations. In this paper, we show how to build an automatic spelling corrector for resourcescarce languages. We propose a sequenceto- sequence deep learning model which trains end-to-end. We perform experiments on synthetic datasets created for Indic languages, Hindi and Telugu, by incorporating the spelling mistakes committed at character level. A comparative evaluation shows that our model is competitive with the existing spell checking and correction techniques for Indic languages.

Cite

CITATION STYLE

APA

Etoori, P., Chinnakotla, M., & Mamidi, R. (2018). Automatic spelling correction for resource-scarce languages using deep learning. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Student Research Workshop (pp. 146–152). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p18-3021

Automatic spelling correction for resource-scarce languages using deep learning

Abstract

Cite

Register to see more suggestions