Detecting and disambiguating locations mentioned in twitter messages

Diana Inkpen; Ji Liu; Atefeh Farzindar; Farzaneh Kazemi; Diman Ghazi

Conference Proceedings

Detecting and disambiguating locations mentioned in twitter messages

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9042 321-332

DOI: 10.1007/978-3-319-18117-2_24

20Citations

31Readers

Get full text

Abstract

Detecting the location entities mentioned in Twitter messages is useful in text mining for business, marketing or defence applications. Therefore, techniques for extracting the location entities from the Twitter textual content are needed. In this work, we approach this task in a similar manner to the Named Entity Recognition (NER) task focused only on locations, but we address a deeper task: classifying the detected locations into names of cities, provinces/states, and countries. We approach the task in a novel way, consisting in two stages. In the first stage, we train Conditional Random Fields (CRF) models with various sets of features; we collected and annotated our own dataset or training and testing. In the second stage, we resolve cases when there exist more than one place with the same name. We propose a set of heuristics for choosing the correct physical location in these cases. We report good evaluation results for both tasks.

Cite

CITATION STYLE

APA

Inkpen, D., Liu, J., Farzindar, A., Kazemi, F., & Ghazi, D. (2015). Detecting and disambiguating locations mentioned in twitter messages. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9042, pp. 321–332). Springer Verlag. https://doi.org/10.1007/978-3-319-18117-2_24

Detecting and disambiguating locations mentioned in twitter messages

Abstract

Cite

Register to see more suggestions