Data Augmentation for Enlarging Student Feature Space and Improving Random Forest Success Prediction

One of the main problems encountered when predicting student success, as a tool to aid students, is the lack of data used to model each student. This lack of data is due in part to the small number of students in each university course and also, the limited number of features that describe the educational background for each student. In this article, we introduce new features by augmenting the student feature space to obtain an improved model. These features are divided into several groups, namely, external added data, metric and counter data, and evolutive data. We will then assess the quality of the augmented data to classify at-risk students in their first year of university. For this article, the classifiers are built using Random Forests. As this learning method measures variable importance, we can enquire on the relevance of the augmented data, as well as the data groups that allow a more significant collection of features.

Mots clés

Student Success Random Forest Data Augmentation Educational Data Mining Student Metrics

Domaines

Informatique [cs]

Fichier principal

FinalPaper.pdf (495.98 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Christel Dartigues-Pallez : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03624792

Soumis le : mercredi 30 mars 2022-14:50:07

Dernière modification le : lundi 26 février 2024-11:22:13

Dates et versions

hal-03624792 , version 1 (30-03-2022)

Identifiants

HAL Id : hal-03624792 , version 1
DOI : 10.1007/978-3-030-78270-2_14

Citer

Timothy Bell, Christel Dartigues-Pallez, Florent Jaillet, Christophe Genolini. Data Augmentation for Enlarging Student Feature Space and Improving Random Forest Success Prediction. Artificial Intelligence in Education. AIED 2021., Jun 2021, Utrecht, Netherlands. pp.82 - 87, ⟨10.1007/978-3-030-78270-2_14⟩. ⟨hal-03624792⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS I3S UNIV-COTEDAZUR

41 Consultations

89 Téléchargements