Deep generative model for joint alignment andword representation

Miguel Rios; Wilker Aziz; Khalil Sima'an

Conference ProceedingsOPEN ACCESS

Deep generative model for joint alignment andword representation

NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (2018) 1 1011-1023

DOI: 10.18653/v1/n18-1092

3Citations

105Readers

Abstract

This work exploits translation data as a source of semantically relevant learning signal for models of word representation. In particular, we exploit equivalence through translation as a form of distributional context and jointly learn how to embed and align with a deep generative model. Our EMBEDALIGN model embeds words in their complete observed context and learns by marginalisation of latent lexical alignments. Besides, it embeds words as posterior probability densities, rather than point estimates, which allows us to compare words in context using a measure of overlap between distributions (e.g. KL divergence). We investigate our model's performance on a range of lexical semantics tasks achieving competitive results on several standard benchmarks including natural language inference, paraphrasing, and text similarity.

Cite

CITATION STYLE

APA

Rios, M., Aziz, W., & Sima’an, K. (2018). Deep generative model for joint alignment andword representation. In NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (Vol. 1, pp. 1011–1023). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n18-1092

Deep generative model for joint alignment andword representation

Abstract

Cite

Register to see more suggestions