Probabilistic word vector and similarity based on dictionaries

Satoshi Suzuki

Conference Proceedings

Probabilistic word vector and similarity based on dictionaries

Suzuki S

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2003) 2588 562-572

DOI: 10.1007/3-540-36456-0_61

1Citations

1Readers

Get full text

Abstract

We propose a new method for computing the probabilistic vector expression of words based on dictionaries. This method provides a well-founded procedure based on stochastic process whose applicability is clear. The proposed method exploits the relationship between headwords and their explanatory notes in dictionaries. An explanatory note is a set of other words, each of which is expanded by its own explanatory note. This expansion is repeatedly applied, but even explanatory notes expanded infinitely can be computed under a simple assumption. The vector expression we obtain is a semantic expansion of the explanatory notes of words. We explain how to acquire the vector expression from these expanded explanatory notes. We also demonstrate a word similarity computation based on a Japanese dictionary and evaluate it in comparison with a known system based on TF · IDF. The results show the effectiveness and applicability of this probabilistic vector expression.

Cite

CITATION STYLE

APA

Suzuki, S. (2003). Probabilistic word vector and similarity based on dictionaries. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 2588, pp. 562–572). Springer Verlag. https://doi.org/10.1007/3-540-36456-0_61

Probabilistic word vector and similarity based on dictionaries

Abstract

Cite

Register to see more suggestions