Which portland is it?: A machine learning approach

3Citations
Citations of this article
6Readers
Mendeley users who have this article in their library.
Get full text

Abstract

This paper reviews several approaches to the problem of toponym resolution for news articles referring to 'Portland.' We train several models to differentiate between Portland, Maine and Portland, Oregon, generating features using only the text of the articles. The data used is in the form of articles pulled from NewsStand. The labels, which are provided by NewsStand's interpretation of the articles, allow for a supervised learning approach. We apply Natural Language Processing (NLP) and data cleaning techniques to process the article data, perform feature reduction, and then feed the data to the models. We show that the logistic regression model performs the best of the four models that we test. We also demonstrate that this model learns a more robust representation of the two classes than the other three models do.

References Powered by Scopus

Bag-of-visual-words and spatial extensions for land-use classification

2512Citations
N/AReaders
Get full text

Modelling vague places with knowledge from the Web

174Citations
N/AReaders
Get full text

NewsStand: A new view on news

132Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Cross-lingual Text Clustering in a Large System

1Citations
N/AReaders
Get full text

Toponym Extraction and Disambiguation from Text: A Survey

0Citations
N/AReaders
Get full text

Cross-Lingual Clustering Using Large Language Models

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Schneider, N. R., & Samet, H. (2021). Which portland is it?: A machine learning approach. In Proceedings of the 5th ACM SIGSPATIAL International Workshop on Location-Based Recommendations, Geosocial Networks and Geoadvertising, LocalRec 2021 (pp. 49–58). Association for Computing Machinery, Inc. https://doi.org/10.1145/3486183.3491066

Readers' Seniority

Tooltip

Researcher 2

67%

Lecturer / Post doc 1

33%

Readers' Discipline

Tooltip

Computer Science 2

67%

Linguistics 1

33%

Save time finding and organizing research with Mendeley

Sign up for free