Detecting and disambiguating locations mentioned in twitter messages

20Citations
Citations of this article
31Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Detecting the location entities mentioned in Twitter messages is useful in text mining for business, marketing or defence applications. Therefore, techniques for extracting the location entities from the Twitter textual content are needed. In this work, we approach this task in a similar manner to the Named Entity Recognition (NER) task focused only on locations, but we address a deeper task: classifying the detected locations into names of cities, provinces/states, and countries. We approach the task in a novel way, consisting in two stages. In the first stage, we train Conditional Random Fields (CRF) models with various sets of features; we collected and annotated our own dataset or training and testing. In the second stage, we resolve cases when there exist more than one place with the same name. We propose a set of heuristics for choosing the correct physical location in these cases. We report good evaluation results for both tasks.

Cite

CITATION STYLE

APA

Inkpen, D., Liu, J., Farzindar, A., Kazemi, F., & Ghazi, D. (2015). Detecting and disambiguating locations mentioned in twitter messages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9042, pp. 321–332). Springer Verlag. https://doi.org/10.1007/978-3-319-18117-2_24

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free