Improving Domain Repository Connectivity

Ted Habermann

Journal ArticleOPEN ACCESS

Improving Domain Repository Connectivity

Habermann T

Data Intelligence (2023) 5(1) 6-26

DOI: 10.1162/dint_a_00120

4Citations

6Readers

Abstract

Domain repositories, i.e. repositories that store, manage, and persist data pertaining to a specific scientific domain, are common and growing in the research landscape. Many of these repositories develop close, long-term communities made up of individuals and organizations that collect, analyze, and publish results based on the data in the repositories. Connections between these datasets, papers, people, and organizations are an important part of the knowledge infrastructure surrounding the repository. All these research objects, people, and organizations can now be identified using various unique and persistent identifiers (PIDs) and it is possible for domain repositories to build on their existing communities to facilitate and accelerate the identifier adoption process. As community members contribute to multiple datasets and articles, identifiers for them, once found, can be used multiple times. We explore this idea by defining a connectivity metric and applying it to datasets collected and papers published by members of the UNAVCO community. Finding identifiers in DataCite and Crossref metadata and spreading those identifiers through the UNAVCO DataCite metadata can increase connectivity from less than 10% to close to 50% for people and organizations.

Author supplied keywords

Cite

CITATION STYLE

APA

Habermann, T. (2023). Improving Domain Repository Connectivity. Data Intelligence, 5(1), 6–26. https://doi.org/10.1162/dint_a_00120

Improving Domain Repository Connectivity

Abstract

Author supplied keywords

Cite

Register to see more suggestions