The language of innovation

Andrea Tacchella; Andrea Napoletano; Luciano Pietronero

Journal ArticleOPEN ACCESS

The language of innovation

PLoS ONE (2020) 15(4)

DOI: 10.1371/journal.pone.0230107

11Citations

47Readers

Abstract

Predicting innovation is a peculiar problem in data science. Following its definition, an innovation is always a never-seen-before event, leaving no room for traditional supervised learning approaches. Here we propose a strategy to address the problem in the context of innovative patents, by defining innovations as never-seen-before associations of technologies and exploiting self-supervised learning techniques. We think of technological codes present in patents as a vocabulary and the whole technological corpus as written in a specific, evolving language. We leverage such structure with techniques borrowed from Natural Language Processing by embedding technologies in a high dimensional euclidean space where relative positions are representative of learned semantics. Proximity in this space is an effective predictor of specific innovation events, that outperforms a wide range of standard link-prediction metrics. The success of patented innovations follows a complex dynamics characterized by different patterns which we analyze in details with specific examples. The methods proposed in this paper provide a completely new way of understanding and forecasting innovation, by tackling it from a revealing perspective and opening interesting scenarios for a number of applications and further analytic approaches.

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Tacchella, A., Napoletano, A., & Pietronero, L. (2020). The language of innovation. PLoS ONE, 15(4). https://doi.org/10.1371/journal.pone.0230107

Readers' Seniority

PhD / Post grad / Masters / Doc 17

59%

Lecturer / Post doc 5

17%

Researcher 4

14%

Professor / Associate Prof. 3

10%

Readers' Discipline

Business, Management and Accounting 7

35%

Computer Science 6

30%

Physics and Astronomy 4

20%

Engineering 3

15%

The language of innovation

Abstract

References Powered by Scopus

The meaning and use of the area under a receiver operating characteristic (ROC) curve

An introduction to ROC analysis

Modularity and community structure in networks

Cited by Powered by Scopus

Adaptation of innovations in the it industry in poland: The impact of selected internal communication factors

Innovation indicators based on firm websites — Which website characteristics predict firm-level innovation activity?

Relatedness in the era of machine learning

Register to see more suggestions

Cite

Readers' Seniority

Readers' Discipline