We describe and motivate an unsupervised lexical error detection and correction algorithm and its application in a tool called Lexbar appearing as a query box on the Web browser toolbar or as a search engine interface. Lexbar accepts as user input candidate strings of English to be checked for acceptability and, where errors are detected, offers corrections. We introduce the notion of hybrid n-gram and extract these from BNC as the knowledgebase against which to compare user input. An extended notion of edit distance is used to identify most likely candidates for correcting detected errors. Results are illustrated with four types of errors.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Tsao, N. L., & Wible, D. (2009). A Method for Unsupervised Broad-Coverage Lexical Error Detection and Correction. In Proceedings of the 4th Workshop on Innovative Use of NLP for Building Educational Applications, BEA 2009 (pp. 51–54). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1609843.1609851