Predicting part-of-speech information about unknown words using statistical methods

Scott M. Thede

Conference ProceedingsOPEN ACCESS

Predicting part-of-speech information about unknown words using statistical methods

Thede S

Proceedings of the Annual Meeting of the Association for Computational Linguistics (1998) 2 1505-1507

DOI: 10.3115/980691.980821

8Citations

82Readers

Abstract

This paper examines the feasibility of using statistical methods to train a part-of-speech predictor for unknown words. By using statistical methods, without incorporating hand-crafted linguistic information, the predictor could be used with any language for which there is a large tagged training corpus. Encouraging results have been obtained by testing the predictor on unknown words from the Brown corpus. The relative value of information sources such as affixes and context is discussed. This part-of-speech predictor will be used in a part-of-speech tagger to handle out-of-lexicon words.

Cite

CITATION STYLE

APA

Thede, S. M. (1998). Predicting part-of-speech information about unknown words using statistical methods. In Proceedings of the Annual Meeting of the Association for Computational Linguistics (Vol. 2, pp. 1505–1507). Association for Computational Linguistics (ACL). https://doi.org/10.3115/980691.980821

Predicting part-of-speech information about unknown words using statistical methods

Abstract

Cite

Register to see more suggestions