An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces

2Citations
Citations of this article
49Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Much recent work in bilingual lexicon induction (BLI) views word embeddings as vectors in Euclidean space. As such, BLI is typically solved by finding a linear transformation that maps embeddings to a common space. Alternatively, word embeddings may be understood as nodes in a weighted graph. This framing allows us to examine a node's graph neighborhood without assuming a linear transform, and exploits new techniques from the graph matching optimization literature. These contrasting approaches have not been compared in BLI so far. In this work, we study the behavior of Euclidean versus graph-based approaches to BLI under differing data conditions and show that they complement each other when combined. We release our code at https://github.com/ kellymarchisio/euc-v-graph-bli.

Cite

CITATION STYLE

APA

Marchisio, K., Park, Y., Saad-Eldi, A., Alyakin, A., Duh, K., Priebe, C., & Koehn, P. (2021). An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces. In Findings of the Association for Computational Linguistics, Findings of ACL: EMNLP 2021 (pp. 738–749). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.findings-emnlp.64

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free