Abstract
We induce semantic association networks from translation relations in parallel corpora. The resulting semantic spaces are encoded in a single reference language, which ensures cross-language comparability. As our main contribution, we cluster the obtained (crosslingually comparable) lexical semantic spaces. We find that, in our sample of languages, lexical semantic spaces largely coincide with genealogical relations. To our knowledge, this constitutes the first large-scale quantitative lexical semantic typology that is completely unsupervised, bottom-up, and datadriven. Our results may be important for the decision which multilingual resources to integrate in a semantic evaluation task.
Cite
CITATION STYLE
Eger, S., Schenk, N., & Mehler, A. (2015). Towards semantic language classification: Inducing and clustering semantic association networks from europarl. In Proceedings of the 4th Joint Conference on Lexical and Computational Semantics, *SEM 2015 (pp. 127–136). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s15-1014
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.