A Question of Style: A Dataset for Analyzing Formality on Different Levels

3Citations
Citations of this article
20Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Accounting for different degrees of formality is crucial for producing contextually appropriate language. To assist NLP applications concerned with this problem and formality analysis in general, we present the first dataset of sentences from a wide range of genres assessed on a continuous informal-formal scale via comparative judgments. It is the first corpus with a comprehensive perspective on German sentence-level formality overall. We compare machine learning models for formality scoring, a task we treat as a regression problem, on our dataset. Finally, we investigate the relation between sentence- and document-level formality and evaluate leveraging sentence-based annotations for assessing formality on documents.

Cite

CITATION STYLE

APA

Eder, E., Krieg-Holz, U., & Wiegand, M. (2023). A Question of Style: A Dataset for Analyzing Formality on Different Levels. In EACL 2023 - 17th Conference of the European Chapter of the Association for Computational Linguistics, Findings of EACL 2023 (pp. 568–581). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2023.findings-eacl.42

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free