Bigrams of syntactic labels for authorship discrimination of short texts

  • Hirst G
  • Feiguina O
  • 61

    Readers

    Mendeley users who have this article in their library.
  • 58

    Citations

    Citations of this article.

Abstract

We present a method for authorship discrimination that is based on the frequency of bigrams of syntactic labels that arise from partial parsing of the text. We show that this method, alone or combined with other classification features, achieves a high accuracy on discrimination of the work of Anne and Charlotte Brontë, which is very difficult to do by traditional methods. Moreover, high accuracies are achieved even on fragments of text little more than 200 words long.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

Authors

  • Graeme Hirst

  • Ol'ga Feiguina

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free