Generating big spatial data on firm innovation activity from text-mined firm websites

9Citations
Citations of this article
34Readers
Mendeley users who have this article in their library.

Abstract

Innovation is one of the major drivers of economic growth, where spatial processes of knowledge spillover play a vital role. Current practices in assessing firms' innovation activity, including patent analysis and questionnaires, suffer from severe limitations. In this paper, we propose a novel approach to estimate firms' innovation activity based on the texts on their websites. We use an automated web-scraper to harvest text from the websites, then extract semantic topics in a self-learning, generative topic-modelling approach, and finally analyse these topics using an Artificial Neural Networks (ANN) method to assess each firm's level of innovation. This procedure results in a large-scale dataset that will be used for further spatial economic analysis of the distribution of innovative firms and the processes that drive the development of innovation in firms.

Cite

CITATION STYLE

APA

Kinne, J., & Resch, B. (2018). Generating big spatial data on firm innovation activity from text-mined firm websites. GI_Forum, 6(1), 82–89. https://doi.org/10.1553/GISCIENCE2018_01_S82

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free