A two-stage sieve approach for quote attribution

45Citations
Citations of this article
106Readers
Mendeley users who have this article in their library.

Abstract

We present a deterministic sieve-based system for attributing quotations in literary text and a new dataset: QuoteLi31. Quote attribution, determining who said what in a given text, is important for tasks like creating dialogue systems, and in newer areas like computational literary studies, where it creates opportunities to analyze novels at scale rather than only a few at a time. We release QuoteLi3, which contains more than 6,000 annotations linking quotes to speaker mentions and quotes to speaker entities, and introduce a new algorithm for quote attribution. Our twostage algorithm first links quotes to mentions, then mentions to entities. Using two stages encapsulates difficult sub-problems and improves system performance. The modular design allows us to tune either for overall performance or for the high precision appropriate for many use cases. Our system achieves an average F-score of 87.5% across three novels, outperforming previous systems, and can be tuned for precision of 90.4% at a recall of 65.1%.

Cite

CITATION STYLE

APA

Muzny, G., Fang, M., Chang, A. X., & Jurafsky, D. (2017). A two-stage sieve approach for quote attribution. In 15th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2017 - Proceedings of Conference (Vol. 1, pp. 460–470). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/e17-1044

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free