More than Words: Using Token Context to Improve Canonicalization of Historical German

  • Jurish B
N/ACitations
Citations of this article
8Readers
Mendeley users who have this article in their library.

Abstract

Historical text presents numerous challenges for contemporary natural language processing techniques. In particular, the absence of consistent orthographic conventions in historical text presents difficulties for any system requiring reference to a fixed lexicon accessed by orthographic form, such as information retrieval systems, part-of-speech taggers, simple word stemmers, or more sophisticated morphological analyzers.

Cite

CITATION STYLE

APA

Jurish, B. (2010). More than Words: Using Token Context to Improve Canonicalization of Historical German. Journal for Language Technology and Computational Linguistics, 25(1), 23–39. https://doi.org/10.21248/jlcl.25.2010.127

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free