Random indexing distributional semantic models for Croatian language

Vedrana Janković; Jan Šnajder; Bojana Dalbelo Bašić

Conference Proceedings

Random indexing distributional semantic models for Croatian language

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2011) 6836 LNAI 411-418

DOI: 10.1007/978-3-642-23538-2_52

2Citations

4Readers

Get full text

Abstract

Distributional semantic models (DSMs) model semantic relations between expressions by comparing the contexts in which these expressions occur. This paper presents an extensive evaluation of distributional semantic models for Croatian language. We focus on random indexing models, an efficient and scalable approach to building DSMs. We build a number of models with different parameters (dimension, context type, and similarity measure) and compare them against human semantic similarity judgments. Our results indicate that even low-dimensional random indexing models may outperform the raw frequency models, and that the choice of the similarity measure is most important. © 2011 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Janković, V., Šnajder, J., & Dalbelo Bašić, B. (2011). Random indexing distributional semantic models for Croatian language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6836 LNAI, pp. 411–418). https://doi.org/10.1007/978-3-642-23538-2_52

Random indexing distributional semantic models for Croatian language

Abstract

Author supplied keywords

Cite

Register to see more suggestions