A method of extracting related words using standardized mutual information

2Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Techniques of automatic extraction of related words are of great importance in many applications such as query expansion and automatic thesaurus construction. In this paper, a method of extracting related words is proposed basing on the statistical information about the co-occurrences of words from huge corpora. The mutual information is one of such statistical measures and has been used for application mainly in natural language processing. A drawback is, however, the mutual information depends mainly on frequencies of words. To overcome this difficulty, we propose as a new measure a normalize deviation of mutual information. We also reveal a correspondence between word ambiguity and related words using word relation graphs constructed using this measure. © Springer-Verlag Berlin Heidelberg 2003.

Cite

CITATION STYLE

APA

Sugimachi, T., Ishino, A., Takeda, M., & Matsuo, F. (2003). A method of extracting related words using standardized mutual information. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2843, 478–485. https://doi.org/10.1007/978-3-540-39644-4_49

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free