Improving part-of-speech tagging of historical text by first translating to modern text

4Citations
Citations of this article
3Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We explore the task of automatically assigning syntactic tags (known as part-of-speech tags) like Noun and Verb to words in seventeenth-century Dutch text. Tools exist for performing this task for modern texts but they perform poorly on historical texts because of language changes. We test several methods for translating the words in the historical text to modern equivalents before applying the tag assignment tools. We show that this additional translation step improves the quality of the automatic syntactic analysis. Further improvements are possible when the lexicons and text collections used for developing the translation process, are extended in size.

Cite

CITATION STYLE

APA

Sang, E. T. K. (2016). Improving part-of-speech tagging of historical text by first translating to modern text. In IFIP Advances in Information and Communication Technology (Vol. 482, pp. 54–64). Springer New York LLC. https://doi.org/10.1007/978-3-319-46224-0_6

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free