Abstract
We construct a new global dataset on common language. The data cover 242 countries and territories and are based on information about the speakers of 6,675 languages. Using data from Ethnologue, we provide 11 bilateral measures reflecting different dimensions of linguistic connections within and between countries, including common official languages, common native and acquired languages, and linguistic proximity across different languages. A key novelty of the dataset is that it includes consistently defined information on linguistic relationships not only between different countries but within the administrative borders of countries as well.
Cite
CITATION STYLE
Gurevich, T., Herman, P. R., Toubal, F., & Yotov, Y. V. (2025). A Dataset on Linguistic Connectivity Across and Within Countries. Scientific Data , 12(1). https://doi.org/10.1038/s41597-025-04692-8
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.