A Feature-based Ensemble Approach to Recognition of Emerging and Rare Named Entities

9Citations
Citations of this article
81Readers
Mendeley users who have this article in their library.

Abstract

Detecting previously unseen named entities in text is a challenging task. The paper describes how three initial classifier models were built using Conditional Random Fields (CRFs), Support Vector Machines (SVMs) and a Long Short-Term Memory (LSTM) recurrent neural network. The outputs of these three classifiers were then used as features to train another CRF classifier working as an ensemble. 5-fold cross-validation based on training and development data for the emerging and rare named entity recognition shared task showed precision, recall and F1- score of 66.87%, 46.75% and 54.97%, respectively. For surface form evaluation, the CRF ensemble-based system achieved precision, recall and F1 scores of 65.18%, 45.20% and 53.30%. When applied to unseen test data, the model reached 47.92% precision, 31.97% recall and 38.55% F1- score for entity level evaluation, with the corresponding surface form evaluation values of 44.91%, 30.47% and 36.31%.

Cite

CITATION STYLE

APA

Sikdar, U. K., & Gamback, B. (2017). A Feature-based Ensemble Approach to Recognition of Emerging and Rare Named Entities. In 3rd Workshop on Noisy User-Generated Text, W-NUT 2017 - Proceedings of the Workshop (pp. 177–181). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w17-4424

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free