Predicting the compositionality of nominal compounds: Giving word embeddings a hard time

Silvio Cordeiro; Carlos Ramisch; Marco Idiart; Aline Villavicencio

Conference ProceedingsOPEN ACCESS

Predicting the compositionality of nominal compounds: Giving word embeddings a hard time

54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (2016) 4 1986-1997

DOI: 10.18653/v1/p16-1187

34Citations

122Readers

Abstract

Distributional semantic models (DSMs) are often evaluated on artificial similarity datasets containing single words or fully compositional phrases. We present a large-scale multilingual evaluation of DSMs for predicting the degree of semantic compositionality of nominal compounds on 4 datasets for English and French. We build a total of 816 DSMs and perform 2,856 evaluations using word2vec, GloVe, and PPMI-based models. In addition to the DSMs, we compare the impact of different parameters, such as level of corpus preprocessing, context window size and number of dimensions. The results obtained have a high correlation with human judgments, being comparable to or outperforming the state of the art for some datasets (Spearman's ρ=.82 for the Reddy dataset).

Cite

CITATION STYLE

APA

Cordeiro, S., Ramisch, C., Idiart, M., & Villavicencio, A. (2016). Predicting the compositionality of nominal compounds: Giving word embeddings a hard time. In 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016 - Long Papers (Vol. 4, pp. 1986–1997). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p16-1187

Predicting the compositionality of nominal compounds: Giving word embeddings a hard time

Abstract

Cite

Register to see more suggestions