A single word is not enough: Ranking multiword expressions using distributional semantics

Martin Riedl; Chris Biemann

Conference ProceedingsOPEN ACCESS

A single word is not enough: Ranking multiword expressions using distributional semantics

Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (2015) 2430-2440

DOI: 10.18653/v1/d15-1290

17Citations

109Readers

Abstract

We present a new unsupervised mechanism, which ranks word n-grams according to their multiwordness. It heavily relies on a new uniqueness measure that computes, based on a distributional thesaurus, how often an n-gram could be replaced in context by a single-worded term. In addition with a downweighting mechanism for incomplete terms this forms a new measure called DRUID. Results show large improvements on two small test sets over competitive baselines. We demonstrate the scalability of the method to large corpora, and the independence of the measure of shallow syntactic filtering.

Cite

CITATION STYLE

APA

Riedl, M., & Biemann, C. (2015). A single word is not enough: Ranking multiword expressions using distributional semantics. In Conference Proceedings - EMNLP 2015: Conference on Empirical Methods in Natural Language Processing (pp. 2430–2440). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/d15-1290

A single word is not enough: Ranking multiword expressions using distributional semantics

Abstract

Cite

Register to see more suggestions