Divide and extract - Disentangling clause splitting and proposition extraction

1Citations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

Proposition extraction from sentences is an important task for information extraction systems. Evaluation of such systems usually conflates two aspects: splitting complex sentences into clauses and the extraction of propositions. It is thus difficult to independently determine the quality of the proposition extraction step. We create a manually annotated proposition dataset from sentences taken from restaurant reviews that distinguishes between clauses that need to be split and those that do not. The resulting proposition evaluation dataset allows us to independently compare the performance of proposition extraction systems on simple and complex clauses. Although performance drastically drops on more complex sentences, we show that the same systems perform best on both simple and complex clauses. Furthermore, we show that specific kinds of subordinate clauses pose difficulties to most systems.

Cite

CITATION STYLE

APA

Gold, D., & Zesch, T. (2019). Divide and extract - Disentangling clause splitting and proposition extraction. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2019-September, pp. 399–408). Incoma Ltd. https://doi.org/10.26615/978-954-452-056-4_047

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free