Abstract
Innovation is one of the major drivers of economic growth, where spatial processes of knowledge spillover play a vital role. Current practices in assessing firms' innovation activity, including patent analysis and questionnaires, suffer from severe limitations. In this paper, we propose a novel approach to estimate firms' innovation activity based on the texts on their websites. We use an automated web-scraper to harvest text from the websites, then extract semantic topics in a self-learning, generative topic-modelling approach, and finally analyse these topics using an Artificial Neural Networks (ANN) method to assess each firm's level of innovation. This procedure results in a large-scale dataset that will be used for further spatial economic analysis of the distribution of innovative firms and the processes that drive the development of innovation in firms.
Author supplied keywords
Cite
CITATION STYLE
Kinne, J., & Resch, B. (2018). Generating big spatial data on firm innovation activity from text-mined firm websites. GI_Forum, 6(1), 82–89. https://doi.org/10.1553/GISCIENCE2018_01_S82
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.