A Dataset on Linguistic Connectivity Across and Within Countries

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We construct a new global dataset on common language. The data cover 242 countries and territories and are based on information about the speakers of 6,675 languages. Using data from Ethnologue, we provide 11 bilateral measures reflecting different dimensions of linguistic connections within and between countries, including common official languages, common native and acquired languages, and linguistic proximity across different languages. A key novelty of the dataset is that it includes consistently defined information on linguistic relationships not only between different countries but within the administrative borders of countries as well.

Cite

CITATION STYLE

APA

Gurevich, T., Herman, P. R., Toubal, F., & Yotov, Y. V. (2025). A Dataset on Linguistic Connectivity Across and Within Countries. Scientific Data , 12(1). https://doi.org/10.1038/s41597-025-04692-8

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free