For the analysis of historical wage development, no structured data is available. Job advertisements, as found in newspapers can provide insights into what different types of jobs paid, but require language technology to structure in a format conducive to quantitative analysis. In this paper, we report on our experiments to mine wages from 19th century newspaper advertisements and detail the challenges that need to be overcome to perform a socioeconomic analysis of textual data sources.
Ros, R., Van Erp, M., Rijpma, A., & Zijdeman, R. (2020). Mining Wages in Nineteenth-Century Job Advertisements. The Application of Language Resources and Language Technology to study Economic and Social Inequality. In Proceedings of LR4SSHOC: Workshop about Language Resources for the SSH Cloud. Language Resources and Evaluation Conference (LREC 2020), Marseille, 11–16 May 2020 (pp. 27–32). European Language Resources Association. Retrieved from https://www.aclweb.org/anthology/2020.lr4sshoc-1.5.pdf