Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization

0Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Annotated named entity corpora play a significant role in many natural language processing applications. However, annotation by humans is time-consuming and costly. In this paper, we propose a high recall pre-annotator which combines multiple existing named entity taggers based on ensemble learning, to reduce the number of annotations that humans have to add. In addition, annotations are categorized into normal annotations and candidate annotations based on their estimated confidence, to reduce the number of human corrective actions as well as the total annotation time. The experiment results show that our approach outperforms the baseline methods in reduction of annotation time without loss in annotation performance (in terms of F-measure).

Cite

CITATION STYLE

APA

Lu, T., Zhu, M., & Gao, Z. (2016). Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10102, pp. 263–274). Springer Verlag. https://doi.org/10.1007/978-3-319-50496-4_22

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free