Privacy-Preserving Synthetic Educational Data Generation

Jill Jênn Vie; Tomas Rigaux; Sein Minn

Conference Proceedings

Privacy-Preserving Synthetic Educational Data Generation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2022) 13450 LNCS 393-406

DOI: 10.1007/978-3-031-16290-9_29

1Citations

8Readers

Get full text

Abstract

Institutions collect massive learning traces but they may not disclose it for privacy issues. Synthetic data generation opens new opportunities for research in education. In this paper we present a generative model for educational data that can preserve the privacy of participants, and an evaluation framework for comparing synthetic data generators. We show how naive pseudonymization can lead to re-identification threats and suggest techniques to guarantee privacy. We evaluate our method on existing massive educational open datasets.

Author supplied keywords

Cite

CITATION STYLE

APA

Vie, J. J., Rigaux, T., & Minn, S. (2022). Privacy-Preserving Synthetic Educational Data Generation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 13450 LNCS, pp. 393–406). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-16290-9_29

Privacy-Preserving Synthetic Educational Data Generation

Abstract

Author supplied keywords

Cite

Register to see more suggestions