Parameter free hierarchical graph-based clustering for analyzing continuous word embeddings

3Citations
Citations of this article
96Readers
Mendeley users who have this article in their library.

Abstract

Word embeddings are high-dimensional vector representations of words and are thus difficult to interpret. In order to deal with this, we introduce an unsupervised parameter free method for creating a hierarchical graphical clustering of the full ensemble of word vectors and show that this structure is a geometrically meaningful representation of the original relations between the words. This newly obtained representation can be used for better understanding and thus improving the embedding algorithm and exhibits semantic meaning, so it can also be utilized in a variety of language processing tasks like categorization or measuring similarity.

Cite

CITATION STYLE

APA

Trost, T. A., & Klakow, D. (2020). Parameter free hierarchical graph-based clustering for analyzing continuous word embeddings. In Proceedings of TextGraphs@ACL 2017: The 11th Workshop on Graph-Based Methods for Natural Language Processing (pp. 30–38). Association for Computational Linguistics. https://doi.org/10.18653/v1/w17-2404

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free