Using low-cost annotation to train a reliable Czech shallow parser

3Citations
Citations of this article
2Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Bushbank is a relatively new concept - a type of annotated corpus where annotation is driven by use of automatic tools and the task of human annotators is limited to accepting or rejecting parts of their output. This creates a possibility to obtain annotated corpora of considerable size at relatively low cost. In this paper we ask the question if the Czech Bushbank is reliable enough to be used for a NLP task instead of a traditional corpus with high annotation rigour. We perform evaluation of three different parsers using its shallow syntactic annotation, including a CRF chunker made originally for Polish. The results are very promising, showing that many practical applications could benefit from low-cost annotation. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Radziszewski, A., & Grác, M. (2013). Using low-cost annotation to train a reliable Czech shallow parser. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 8082 LNAI, pp. 575–582). https://doi.org/10.1007/978-3-642-40585-3_72

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free