Crowdsourced hedge term disambiguation

Morgan Ulinski; Julia Hirschberg

Conference ProceedingsOPEN ACCESS

Crowdsourced hedge term disambiguation

LAW 2019 - 13th Linguistic Annotation Workshop, Proceedings of the Workshop (2019) 1-5

DOI: 10.18653/v1/w19-4001

2Citations

75Readers

Abstract

We address the issue of acquiring quality annotations of hedging words and phrases, linguistic phenomenona in which words, sounds, or other constructions are used to express ambiguity or uncertainty. Due to the limited availability of existing corpora annotated for hedging, linguists and other language scientists have been constrained as to the extent they can study this phenomenon. In this paper, we introduce a new method of acquiring hedging annotations via crowdsourcing, based on reformulating the task of labeling hedges as a simple word sense disambiguation task. We also introduce a new hedging corpus we have constructed by applying this method, a collection of forum posts annotated using Amazon Mechanical Turk. We found that the crowdsourced judgments we obtained had an inter-annotator agreement of 92.89% (Fleiss’ Kappa=0.751) and, when comparing a subset of these annotations to an expert-annotated gold standard, an accuracy of 96.65%.

Cite

CITATION STYLE

APA

Ulinski, M., & Hirschberg, J. (2019). Crowdsourced hedge term disambiguation. In LAW 2019 - 13th Linguistic Annotation Workshop, Proceedings of the Workshop (pp. 1–5). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w19-4001

Crowdsourced hedge term disambiguation

Abstract

Cite

Register to see more suggestions