Linguistic ethnography: Identifying dominant word classes in text

Rada Mihalcea; Stephen Pulman

Conference Proceedings

Linguistic ethnography: Identifying dominant word classes in text

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5449 LNCS 594-602

DOI: 10.1007/978-3-642-00382-0_48

20Citations

15Readers

Get full text

Abstract

In this paper, we propose a method for "linguistic ethnography" - a general mechanism for characterising texts with respect to the dominance of certain classes of words. Using humour as a case study, we explore the automatic learning of salient word classes, including semantic classes (e.g., person, animal), psycholinguistic classes (e.g., tentative, cause), and affective load (e.g., anger, happiness).We measure the reliability of the derived word classes and their associated dominance scores by showing significant correlation across different corpora. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Mihalcea, R., & Pulman, S. (2009). Linguistic ethnography: Identifying dominant word classes in text. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5449 LNCS, pp. 594–602). https://doi.org/10.1007/978-3-642-00382-0_48

Linguistic ethnography: Identifying dominant word classes in text

Abstract

Cite

Register to see more suggestions