Privacy-Preserving Synthetic Educational Data Generation

Jill-Jênn Vie; Tomas Rigaux; Sein Minn

Communication Dans Un Congrès Année : 2022

Privacy-Preserving Synthetic Educational Data Generation

(1) , (1) , (2)

1
2

Jill-Jênn Vie

Fonction : Auteur
PersonId : 9988
IdHAL : jill-jenn-vie
ORCID : 0000-0002-9304-2220
IdRef : 192337890

Méthodes computationnelles et mathématiques pour comprendre la société et la santé à partir de données

Tomas Rigaux

Fonction : Auteur
PersonId : 1147309

Méthodes computationnelles et mathématiques pour comprendre la société et la santé à partir de données

Sein Minn

Fonction : Auteur
PersonId : 1147310

Rich Data Analytics at Cloud Scale

Résumé

Institutions collect massive learning traces but they may not disclose it for privacy issues. Synthetic data generation opens new opportunities for research in education. In this paper we present a generative model for educational data that can preserve the privacy of participants, and an evaluation framework for comparing synthetic data generators. We show how naive pseudonymization can lead to re-identification threats and suggest techniques to guarantee privacy. We evaluate our method on existing massive educational open datasets.

Mots clés

Privacy Item response theory Generative models

Domaines

Intelligence artificielle [cs.AI] Environnements Informatiques pour l'Apprentissage Humain Cryptographie et sécurité [cs.CR]

Fichier principal

EC_TEL_2022_paper_83_Vie.pdf (480.09 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jill-Jênn Vie : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03715416

Soumis le : mercredi 6 juillet 2022-13:41:10

Dernière modification le : lundi 29 janvier 2024-14:51:40

Dates et versions

hal-03715416 , version 1 (06-07-2022)

Identifiants

HAL Id : hal-03715416 , version 1
ARXIV : 2207.03202

Citer

Jill-Jênn Vie, Tomas Rigaux, Sein Minn. Privacy-Preserving Synthetic Educational Data Generation. EC-TEL 2022 - 17th European Conference on Technology Enhanced Learning, Sep 2022, Toulouse, France. ⟨hal-03715416⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X TICE CNRS INRIA LIX X-LIX X-DEP-INFO INRIA2 TEL IP_PARIS GS-COMPUTER-SCIENCE

243 Consultations

152 Téléchargements

Privacy-Preserving Synthetic Educational Data Generation

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager