Automatically Categorizing Written Texts by Author Gender

467Citations
Citations of this article
306Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The problem of automatically determining the gender of a document’s author would appear to be a more subtle problem than those of categorization by topic or authorship attribution. Nevertheless, it is shown that automated text categorization techniques can exploit combinations of simple lexical and syntactic features to infer the gender of the author of an unseen formal written document with approximately 80 per cent accuracy. The same techniques can be used to determine if a document is fiction or non-fiction with approximately 98 per cent accuracy. © 2002 Association for Literary & Linguistic Computing.

Cite

CITATION STYLE

APA

Koppel, M., Argamon, S., & Shimoni, A. R. (2002). Automatically Categorizing Written Texts by Author Gender. Literary and Linguistic Computing, 17(4), 401–412. https://doi.org/10.1093/llc/17.4.401

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free