Automatic curation of clinical trials data in LinkedCT

Oktie Hassanzadeh; Renée J. Miller

Conference ProceedingsOPEN ACCESS

Automatic curation of clinical trials data in LinkedCT

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2015) 9367 270-278

DOI: 10.1007/978-3-319-25010-6_16

8Citations

18Readers

Abstract

The Linked Clinical Trials (LinkedCT) project started back in 2008 with the goal of providing a Linked Data source of clinical trials. The source of the data is from the XML data published on ClinicalTrials. gov, which is an international registry of clinical studies. Since the initial release, the LinkedCT project has gone through some major changes to both improve the quality of the data and its freshness. The result is a high-quality Linked Data source of clinical studies that is updated daily, currently containing over 195,000 trials, 4.6 million entities, and 42 million triples. In this paper, we present a detailed description of the system along with a brief outline of technical challenges involved in curating the raw XML data into high-quality Linked Data. We also present usage statistics and a number of interesting use cases developed by external parties. We share the lessons learned in the design and implementation of the current system, along with an outline of our future plans for the project which include making the system open-source and making the data free for commercial use.

Author supplied keywords

Cite

CITATION STYLE

APA

Hassanzadeh, O., & Miller, R. J. (2015). Automatic curation of clinical trials data in LinkedCT. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9367, pp. 270–278). Springer Verlag. https://doi.org/10.1007/978-3-319-25010-6_16

Automatic curation of clinical trials data in LinkedCT

Abstract

Author supplied keywords

Cite

Register to see more suggestions