Kernel density estimation for text-based geolocation

Mans Hulden; Miikka Silfverberg; Jerid Francom

Conference ProceedingsOPEN ACCESS

Kernel density estimation for text-based geolocation

Proceedings of the National Conference on Artificial Intelligence (2015) 1 145-150

DOI: 10.1609/aaai.v29i1.9149

51Citations

40Readers

Abstract

Text-based geolocation classifiers often operate with a grid-based view of the world. Predicting document location of origin based on text content on a geodesic grid is computationally attractive since many standard methods for supervised document classification carry over unchanged to geolocation in the form of predicting a most probable grid cell for a document. However, the grid-based approach suffers from sparse data problems if one wants to improve classification accuracy by moving to smaller cell sizes. In this paper we investigate an enhancement of common methods for determining the geographic point of origin of a text document by kernel density estimation. For geolocation of tweets we obtain a improvements upon non-kernel methods on datasets of U.S. and global Twitter content.

Cite

CITATION STYLE

APA

Hulden, M., Silfverberg, M., & Francom, J. (2015). Kernel density estimation for text-based geolocation. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 145–150). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9149

Kernel density estimation for text-based geolocation

Abstract

Cite

Register to see more suggestions