Automatically Categorizing Written Texts by Author Gender

  • Koppel M
  • Argamon S
  • Shimoni A
  • 209


    Mendeley users who have this article in their library.
  • 270


    Citations of this article.


The problem of automatically determining the gender of a document's author would appear to be a more subtle problem than those of categorization by topic or authorship attribution. Nevertheless, it is shown that automated text categorization techniques can exploit combinations of simple lexical and syntactic features to infer the gender of the author of an unseen formal written document with approximately 80 per cent accuracy. The same techniques can be used to determine if a document is fiction or non-fiction with approximately 98 per cent accuracy.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document


  • Moshe Koppel

  • Shlomo Argamon

  • Anat Rachel Shimoni

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free