Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization

5Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Opinion summarization provides an important solution for summarizing opinions expressed among a large number of reviews. However, generating aspect-specific and general summaries is challenging due to the lack of annotated data. In this work, we propose two simple yet effective unsupervised approaches to generate both aspect-specific and general opinion summaries by training on synthetic datasets constructed with aspect-related review contents. Our first approach, Seed Words Based Leave-One-Out (SW-LOO), identifies aspect-related portions of reviews simply by exact-matching aspect seed words and outperforms existing methods by 3.4 ROUGE-L points on SPACE and 0.5 ROUGE-1 point on OPOSUM+ for aspect-specific opinion summarization. Our second approach, Natural Language Inference Based Leave-One-Out (NLI-LOO) identifies aspect-related sentences utilizing an NLI model in a more general setting without using seed words and outperforms existing approaches by 1.2 ROUGE-L points on SPACE for aspect-specific opinion summarization and remains competitive on other metrics.

Cite

CITATION STYLE

APA

Shen, M., Ma, J., Wang, S., Vyas, Y., Dixit, K., Ballesteros, M., & Benajiba, Y. (2023). Simple Yet Effective Synthetic Dataset Construction for Unsupervised Opinion Summarization. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023 (pp. 1853–1866). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-eacl.142

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free