Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization

Tingming Lu; Man Zhu; Zhiqiang Gao

Book Chapter

Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization

Springer Verlag, (2016), 263-274

DOI: 10.1007/978-3-319-50496-4_22

0Citations

6Readers

Get full text

Abstract

Annotated named entity corpora play a significant role in many natural language processing applications. However, annotation by humans is time-consuming and costly. In this paper, we propose a high recall pre-annotator which combines multiple existing named entity taggers based on ensemble learning, to reduce the number of annotations that humans have to add. In addition, annotations are categorized into normal annotations and candidate annotations based on their estimated confidence, to reduce the number of human corrective actions as well as the total annotation time. The experiment results show that our approach outperforms the baseline methods in reduction of annotation time without loss in annotation performance (in terms of F-measure).

Author supplied keywords

Cite

CITATION STYLE

APA

Lu, T., Zhu, M., & Gao, Z. (2016). Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10102, pp. 263–274). Springer Verlag. https://doi.org/10.1007/978-3-319-50496-4_22

Reducing human effort in named entity corpus construction based on ensemble learning and annotation categorization

Abstract

Author supplied keywords

Cite

Register to see more suggestions