Towards the representation of genomic data in HL7 FHIR and OMOP CDM

8Citations
Citations of this article
35Readers
Mendeley users who have this article in their library.

Abstract

High throughput sequencing technologies have facilitated an outburst in biological knowledge over the past decades and thus enables improvements in personalized medicine. In order to support (international) medical research with the combination of genomic and clinical patient data, a standardization and harmonization of these data sources is highly desirable. To support this increasing importance of genomic data, we have created semantic mapping from raw genomic data to both FHIR (Fast Healthcare Interoperability Resources) and OMOP (Observational Medical Outcomes Partnership) CDM (Common Data Model) and analyzed the data coverage of both models. For this, we calculated the mapping score for different data categories and the relative data coverage in both FHIR and OMOP CDM. Our results show, that the patients genomic data can be mapped to OMOP CDM directly from VCF (Variant Call Format) file with a coverage of slightly over 50%. However, using FHIR as intermediate representation does not lead to further information loss as the already stored data in FHIR can be further transformed into OMOP CDM format with almost 100% success. Our findings are in favor of extending OMOP CDM with patient genomic data using ETL to enable the researchers to apply different analysis methods including machine learning algorithms on genomic data.

Author supplied keywords

Cite

CITATION STYLE

APA

Peng, Y., Nassirian, A., Ahmadi, N., Sedlmayr, M., & Bathelt, F. (2021). Towards the representation of genomic data in HL7 FHIR and OMOP CDM. In Studies in Health Technology and Informatics (Vol. 283, pp. 86–94). IOS Press BV. https://doi.org/10.3233/SHTI210545

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free