Privacy-preserving data mining of medical data using data separation-based techniques

Kou Gang; Peng Yi; Shi Yong; Chen Zhengxin

Journal ArticleOPEN ACCESS

Privacy-preserving data mining of medical data using data separation-based techniques

Data Science Journal (2007) 6(SUPPL.)

DOI: 10.2481/dsj.6.S429

18Citations

8Readers

Abstract

Data mining is concerned with the extraction of useful knowledge from various types of data. Medical data mining has been a popular data mining topic of late. Compared with other data mining areas, medical data mining has some unique characteristics. Because medical files are related to human subjects, privacy concerns are taken more seriously than other data mining tasks. This paper applied data separation-based techniques to preserve privacy in classification of medical data. We take two approaches to protect privacy: one approach is to vertically partition the medical data and mine these partitioned data at multiple sites; the other approach is to horizontally split data across multiple sites. In the vertical partition approach, each site uses a portion of the attributes to compute its results, and the distributed results are assembled at a central trusted party using a majority-vote ensemble method. In the horizontal partition approach, data are distributed among several sites. Each site computes its own data, and a central trusted party is responsible to integrate these results. We implement these two approaches using medical datasets from UCI KDD archive and report the experimental results.

Author supplied keywords

Cite

CITATION STYLE

APA

Gang, K., Yi, P., Yong, S., & Zhengxin, C. (2007). Privacy-preserving data mining of medical data using data separation-based techniques. Data Science Journal, 6(SUPPL.). https://doi.org/10.2481/dsj.6.S429

Privacy-preserving data mining of medical data using data separation-based techniques

Abstract

Author supplied keywords

Cite

Register to see more suggestions