Technical challenges for big data in biomedicine and health: data sources, infrastructure, and analytics

68Citations
Citations of this article
215Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

OBJECTIVES: To review technical and methodological challenges for big data research in biomedicine and health. METHODS: We discuss sources of big datasets, survey infrastructures for big data storage and big data processing, and describe the main challenges that arise when analyzing big data. RESULTS: The life and biomedical sciences are massively contributing to the big data revolution through secondary use of data that were collected during routine care and through new data sources such as social media. Efficient processing of big datasets is typically achieved by distributing computation over a cluster of computers. Data analysts should be aware of pitfalls related to big data such as bias in routine care data and the risk of false-positive findings in high-dimensional datasets. CONCLUSIONS: The major challenge for the near future is to transform analytical methods that are used in the biomedical and health domain, to fit the distributed storage and processing model that is required to handle big data, while ensuring confidentiality of the data being analyzed.

Cite

CITATION STYLE

APA

Peek, N., Holmes, J. H., & Sun, J. (2014, August 15). Technical challenges for big data in biomedicine and health: data sources, infrastructure, and analytics. Yearbook of Medical Informatics. https://doi.org/10.15265/IY-2014-0018

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free