Risk upper bounds for general ensemble methods with an application to multiclass classification

François Laviolette; Emilie Morvant; Liva Ralaivola; Jean-Francis Roy

doi:10.1016/j.neucom.2016.09.016

Article Dans Une Revue Neurocomputing Année : 2017

Risk upper bounds for general ensemble methods with an application to multiclass classification

(1) , (2) , (3, 4) , (1)

1
2
3
4

François Laviolette

Fonction : Auteur
PersonId : 946278

Department of Computer Science and Software Engineering [Québec]

Emilie Morvant

Fonction : Auteur
PersonId : 410
IdHAL : emilie-morvant
ORCID : 0000-0002-8301-7240
IdRef : 179027468

Lampert Group

Liva Ralaivola

Fonction : Auteur
PersonId : 5004
IdHAL : livaralaivola
ORCID : 0000-0002-4571-1119
IdRef : 089319060

éQuipe d'AppRentissage de MArseille

Institut universitaire de France

Jean-Francis Roy

Fonction : Auteur
PersonId : 959079

Department of Computer Science and Software Engineering [Québec]

Résumé

This paper generalizes a pivotal result from the PAC-Bayesian literature —the −bound— primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the −bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.

Mots clés

Majority vote Ensemble methods PAC-Bayesian Theory Multiclass classification Multilabel Prediction

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

cbound_multi.pdf (544.12 Ko)

Liva Ralaivola : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01774837

Soumis le : lundi 4 juin 2018-14:08:35

Dernière modification le : lundi 15 avril 2024-11:25:23

Archivage à long terme le : mercredi 5 septembre 2018-12:21:52

Dates et versions

hal-01774837 , version 1 (04-06-2018)

Identifiants

HAL Id : hal-01774837 , version 1
DOI : 10.1016/j.neucom.2016.09.016

Citer

François Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy. Risk upper bounds for general ensemble methods with an application to multiclass classification. Neurocomputing, 2017, 219, pp.15 - 25. ⟨10.1016/j.neucom.2016.09.016⟩. ⟨hal-01774837⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLN CNRS UNIV-AMU LIS-LAB ANR

197 Consultations

521 Téléchargements

Risk upper bounds for general ensemble methods with an application to multiclass classification

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager