Improving the performance of a named entity extractor by applying a stacking scheme

José A. Troyano; Víctor J. Díaz; Fernando Enríquez; Luisa Romero

Conference Proceedings

Improving the performance of a named entity extractor by applying a stacking scheme

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (2004) 3315 295-304

DOI: 10.1007/978-3-540-30498-2_30

0Citations

6Readers

Get full text

Abstract

In this paper we investigate the way of improving the performance of a Named Entity Extraction (NEE) system by applying machine learning techniques and corpus transformation. The main resources used in our experiments are the publicly available tagger TnT and a corpus of Spanish texts in which named entities occurrences are tagged with BIO tags. We split the NEE task into two subtasks 1) Named Entity Recognition (NER) that involves the identification of the group of words that make up the name of an entity and 2) Named Entity Classification (NEC) that determines the category of a named entity. We have focused our work on the improvement of the NER task, generating four different taggers with the same training corpus and combining them using a stacking scheme. We improve the baseline of the NER task (Fβ=1 value of 81.84) up to a value of 88.37. When a NEC module is added to the NER system the performance of the whole NEE task is also improved. A value of 70.47 is achieved from a baseline of 66.07. © Springer-Verlag Berlin Heidelberg 2004.

Cite

CITATION STYLE

APA

Troyano, J. A., Díaz, V. J., Enríquez, F., & Romero, L. (2004). Improving the performance of a named entity extractor by applying a stacking scheme. In Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science) (Vol. 3315, pp. 295–304). Springer Verlag. https://doi.org/10.1007/978-3-540-30498-2_30

Improving the performance of a named entity extractor by applying a stacking scheme

Abstract

Cite

Register to see more suggestions