Exploring numeracy in word embeddings

Aakanksha Naik; Abhilasha Ravichander; Carolyn Rose; Eduard Hovy

Conference ProceedingsOPEN ACCESS

Exploring numeracy in word embeddings

ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2020) 3374-3380

DOI: 10.18653/v1/p19-1329

41Citations

148Readers

Abstract

Word embeddings are now pervasive across NLP subfields as the de-facto method of forming text representataions. In this work, we show that existing embedding models are inadequate at constructing representations that capture salient aspects of mathematical meaning for numbers, which is important for language understanding. Numbers are ubiquitous and frequently appear in text. Inspired by cognitive studies on how humans perceive numbers, we develop an analysis framework to test how well word embeddings capture two essential properties of numbers: magnitude (e.g. 3<4) and numeration (e.g. 3=three). Our experiments reveal that most models capture an approximate notion of magnitude, but are inadequate at capturing numeration. We hope that our observations provide a starting point for the development of methods which better capture numeracy in NLP systems.

Cite

CITATION STYLE

APA

Naik, A., Ravichander, A., Rose, C., & Hovy, E. (2020). Exploring numeracy in word embeddings. In ACL 2019 - 57th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (pp. 3374–3380). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p19-1329

Exploring numeracy in word embeddings

Abstract

Cite

Register to see more suggestions