Context and domain knowledge enhanced entity spotting in Informal text

Daniel Gruhl; Meena Nagarajan; Jan Pieper; Christine Robson; Amit Sheth

Conference ProceedingsOPEN ACCESS

Context and domain knowledge enhanced entity spotting in Informal text

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2009) 5823 LNCS 260-276

DOI: 10.1007/978-3-642-04930-9_17

30Citations

49Readers

Abstract

This paper explores the application of restricted relationship graphs (RDF) and statistical NLP techniques to improve named entity annotation in challenging Informal English domains. We validate our approach using on-line forums discussing popular music. Named entity annotation is particularly difficult in this domain because it is characterized by a large number of ambiguous entities, such as the Madonna album "Music" or Lilly Allen's pop hit "Smile". We evaluate improvements in annotation accuracy that can be obtained by restricting the set of possible entities using real-world constraints. We find that constrained domain entity extraction raises the annotation accuracy significantly, making an infeasible task practical. We then show that we can further improve annotation accuracy by over 50% by applying SVM based NLP systems trained on word-usages in this domain. © Springer-Verlag Berlin Heidelberg 2009.

Cite

CITATION STYLE

APA

Gruhl, D., Nagarajan, M., Pieper, J., Robson, C., & Sheth, A. (2009). Context and domain knowledge enhanced entity spotting in Informal text. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 5823 LNCS, pp. 260–276). Springer Verlag. https://doi.org/10.1007/978-3-642-04930-9_17

Context and domain knowledge enhanced entity spotting in Informal text

Abstract

Cite

Register to see more suggestions