Compound or Term Features? Analyzing Salience in Predicting the Difficulty of German Noun Compounds across Domains

2Citations
Citations of this article
41Readers
Mendeley users who have this article in their library.

Abstract

Predicting the difficulty of domain-specific vocabulary is an important task towards a better understanding of a domain, and to enhance the communication between lay people and experts. We investigate German closed noun compounds and focus on the interaction of compound-based lexical features (such as frequency and productivity) and terminologybased features (contrasting domain-specific and general language) across word representations and classifiers. Our prediction experiments complement insights from classification using (a) manually designed features to characterise termhood and compound formation and (b) compound and constituent word embeddings. We find that for a broad binary distinction into easy vs. difficult general-language compound frequency is sufficient, but for a more fine-grained four-class distinction it is crucial to include contrastive termhood features and compound and constituent features.

Cite

CITATION STYLE

APA

Hatty, A., Bettinger, J., Dorna, M., Kuhn, J., & Im Walde, S. S. (2021). Compound or Term Features? Analyzing Salience in Predicting the Difficulty of German Noun Compounds across Domains. In *SEM 2021 - 10th Conference on Lexical and Computational Semantics, Proceedings of the Conference (pp. 252–262). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.starsem-1.24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free