Abstract
Text-based geolocation classifiers often operate with a grid-based view of the world. Predicting document location of origin based on text content on a geodesic grid is computationally attractive since many standard methods for supervised document classification carry over unchanged to geolocation in the form of predicting a most probable grid cell for a document. However, the grid-based approach suffers from sparse data problems if one wants to improve classification accuracy by moving to smaller cell sizes. In this paper we investigate an enhancement of common methods for determining the geographic point of origin of a text document by kernel density estimation. For geolocation of tweets we obtain a improvements upon non-kernel methods on datasets of U.S. and global Twitter content.
Cite
CITATION STYLE
Hulden, M., Silfverberg, M., & Francom, J. (2015). Kernel density estimation for text-based geolocation. In Proceedings of the National Conference on Artificial Intelligence (Vol. 1, pp. 145–150). AI Access Foundation. https://doi.org/10.1609/aaai.v29i1.9149
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.