Boosting Diversity in Regression Ensembles - Archive ouverte HAL Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Boosting Diversity in Regression Ensembles

Mathias Bourel
  • Fonction : Auteur
Yannig Goude
  • Fonction : Auteur
Jean-Michel Poggi
  • Fonction : Auteur

Résumé

The practical interest of using ensemble methods has been highlighted in several works. Aggregating predictors leads very often to improve the performance of a single one. A fruitful recipe is to generate several predictors from a single one by perturbing the learning set and, instead of selecting the best one, to aggregate them. Bagging, boosting and Random forests are examples of such strategies useful both for classification and regression problems. A key ingredient to properly analyse the improvement of prediction performance is the diversity of the predictors ensemble. In the regression case, aggregation is mainly interested on how to generate individual predictors to improve quadratic prediction performance. We look for enhancing these methods by using the concept of diversity (also known as negative correlation learning). We propose an algorithm to enrich the set of original individual predictors using a gradient boosting-based method by incorporating a diversity term to guide the gradient boosting iterations. The idea is to progressively generate predictors by boosting diversity, this modification induces some kind of suboptimality of the individual learners but improve the ensemble. Then, we establish a convergence result ensuring that the associated optimisation strategy converges to a global optimum. Finally, we show by means of numerical experiments the appropriateness of our procedure and examine not only the final predictor or the aggregated one but also the generated sequence. First, on a simulated dataset, we illustrate and study the method with respect to the family of predictors as well the parameters to be tuned (diversity weight and gradient step). Second, real-world electricity demand datasets are considered opening the application of such ideas to the forecasting context.
Fichier principal
Vignette du fichier
main_modif-jc.pdf (506.66 Ko) Télécharger le fichier
Boosting Diversity.zip (468.88 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03041309 , version 1 (04-12-2020)

Identifiants

  • HAL Id : hal-03041309 , version 1

Citer

Mathias Bourel, Jairo Cugliari, Yannig Goude, Jean-Michel Poggi. Boosting Diversity in Regression Ensembles. 2020. ⟨hal-03041309⟩
275 Consultations
242 Téléchargements

Partager

Gmail Facebook X LinkedIn More