Geocoding freeform placenames: An example of deciphering the czech national immigration database

1Citations
Citations of this article
10Readers
Mendeley users who have this article in their library.

Abstract

The growth of international migration and its societal and political impacts bring a greater need for accurate data to measure, understand and control migration flows. However, in the Czech immigration database, the birthplaces of immigrants are only kept in freeform text fields, a substantial obstacle to their further processing due to numerous errors in transcription and spelling. This study overcomes this obstacle by deploying a custom geocoding engine based on GeoNames, tailored transcription rules and fuzzy matching in order to achieve good accuracy even for noisy data while not depending on third-party services, resulting in lower costs than the comparable approaches. The results are presented on a subnational level for the immigrants coming to Czechia from the USA, Ukraine, Moldova and Vietnam, revealing important spatial patterns that are invisible on the national level.

Cite

CITATION STYLE

APA

Šimbera, J., Drbohlav, D., & Štych, P. (2021). Geocoding freeform placenames: An example of deciphering the czech national immigration database. ISPRS International Journal of Geo-Information, 10(5). https://doi.org/10.3390/ijgi10050335

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free