USFD: Twitter NER with Drift Compensation and Linked Data

Leon Derczynski; Isabelle Augenstein; Kalina Bontcheva

Conference ProceedingsOPEN ACCESS

USFD: Twitter NER with Drift Compensation and Linked Data

ACL-IJCNLP 2015 - Workshop on Noisy User-Generated Text, WNUT 2015 - Proceedings of the Workshop (2015) 48-53

DOI: 10.18653/v1/w15-4306

9Citations

80Readers

Abstract

This paper describes a pilot NER system for Twitter, comprising the USFD system entry to the W-NUT 2015 NER shared task. The goal is to correctly label entities in a tweet dataset, using an inventory of ten types. We employ structured learning, drawing on gazetteers taken from Linked Data, and on unsupervised clustering features, and attempting to compensate for stylistic and topic drift - a key challenge in social media text. Our result is competitive; we provide an analysis of the components of our methodology, and an examination of the target dataset in the context of this task.

Cite

CITATION STYLE

APA

Derczynski, L., Augenstein, I., & Bontcheva, K. (2015). USFD: Twitter NER with Drift Compensation and Linked Data. In ACL-IJCNLP 2015 - Workshop on Noisy User-Generated Text, WNUT 2015 - Proceedings of the Workshop (pp. 48–53). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/w15-4306

USFD: Twitter NER with Drift Compensation and Linked Data

Abstract

Cite

Register to see more suggestions