Model Selection for Data Analysis in Encrypted Domain: Application to Simple Linear Regression

Mi Yeon Hong; Ji Won Yoon

Conference Proceedings

Model Selection for Data Analysis in Encrypted Domain: Application to Simple Linear Regression

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2020) 11897 LNCS 155-166

DOI: 10.1007/978-3-030-39303-8_12

1Citations

11Readers

Get full text

Abstract

In the big data era, data scientists explore machine learning methods for observed data to predict or classify. For machine learining to be effective, it requires access to raw data which is often privacy sensitive. In addition, whatever data and fitting procedures are employed, a crucial step is to select the most appropriate model from the given dataset. Model selection is a key ingredient in data analysis for reliable and reproducible statistical inference or prediction. To address this issue, we develop new techniques to provide solutions for running model selection over encrypted data. Our approach provides the best approximation of the relationship between the dependent and independent variable through cross validation. After performing 4-fold cross validation, 4 different estimates of our model’s errors are calculated. And then we use bias and variance extracted from these errors to find the best model. We perform an experiment on a dataset extracted from Kaggle and show that our approach can homomorphically regress a given encrypted data without decrypting it.

Author supplied keywords

Cite

CITATION STYLE

APA

Hong, M. Y., & Yoon, J. W. (2020). Model Selection for Data Analysis in Encrypted Domain: Application to Simple Linear Regression. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11897 LNCS, pp. 155–166). Springer. https://doi.org/10.1007/978-3-030-39303-8_12

Model Selection for Data Analysis in Encrypted Domain: Application to Simple Linear Regression

Abstract

Author supplied keywords

Cite

Register to see more suggestions