Segment representations in named entity recognition

Michal Konkol; Miloslav Konopík

Conference Proceedings

Segment representations in named entity recognition

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9302 61-70

DOI: 10.1007/978-3-319-24033-6_7

22Citations

32Readers

Get full text

Abstract

In this paper we study the effects of various segment representations in the named entity recognition (NER) task. The segment representation is responsible for mapping multi-word entities into classes used in the chosen machine learning approach. Usually, the choice of a segment representation in the NER system is arbitrary without proper tests. Some authors presented comparisons of different segment representations such as BIO, BIEO, BILOU and usually compared only two segment representations. Our goal is to show, that the segment representation problem is more complex and that the proper selection of the best approach is not straightforward. We provide experiments with a wide set of segment representations. All the representations are tested using two popular machine learning algorithms: Conditional Random Fields and Maximum Entropy. Furthermore, the tests are done on four languages, namely English, Spanish, Dutch and Czech.

Cite

CITATION STYLE

APA

Konkol, M., & Konopík, M. (2015). Segment representations in named entity recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9302, pp. 61–70). Springer Verlag. https://doi.org/10.1007/978-3-319-24033-6_7

Segment representations in named entity recognition

Abstract

Cite

Register to see more suggestions