The language of innovation

11Citations
Citations of this article
47Readers
Mendeley users who have this article in their library.

Abstract

Predicting innovation is a peculiar problem in data science. Following its definition, an innovation is always a never-seen-before event, leaving no room for traditional supervised learning approaches. Here we propose a strategy to address the problem in the context of innovative patents, by defining innovations as never-seen-before associations of technologies and exploiting self-supervised learning techniques. We think of technological codes present in patents as a vocabulary and the whole technological corpus as written in a specific, evolving language. We leverage such structure with techniques borrowed from Natural Language Processing by embedding technologies in a high dimensional euclidean space where relative positions are representative of learned semantics. Proximity in this space is an effective predictor of specific innovation events, that outperforms a wide range of standard link-prediction metrics. The success of patented innovations follows a complex dynamics characterized by different patterns which we analyze in details with specific examples. The methods proposed in this paper provide a completely new way of understanding and forecasting innovation, by tackling it from a revealing perspective and opening interesting scenarios for a number of applications and further analytic approaches.

References Powered by Scopus

The meaning and use of the area under a receiver operating characteristic (ROC) curve

17829Citations
N/AReaders
Get full text

An introduction to ROC analysis

16069Citations
N/AReaders
Get full text

Modularity and community structure in networks

9168Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Adaptation of innovations in the it industry in poland: The impact of selected internal communication factors

20Citations
N/AReaders
Get full text

Innovation indicators based on firm websites — Which website characteristics predict firm-level innovation activity?

14Citations
N/AReaders
Get full text

Relatedness in the era of machine learning

7Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Tacchella, A., Napoletano, A., & Pietronero, L. (2020). The language of innovation. PLoS ONE, 15(4). https://doi.org/10.1371/journal.pone.0230107

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 17

59%

Lecturer / Post doc 5

17%

Researcher 4

14%

Professor / Associate Prof. 3

10%

Readers' Discipline

Tooltip

Business, Management and Accounting 7

35%

Computer Science 6

30%

Physics and Astronomy 4

20%

Engineering 3

15%

Save time finding and organizing research with Mendeley

Sign up for free