Risk upper bounds for general ensemble methods with an application to multiclass classification

Abstract : This paper generalizes a pivotal result from the PAC-Bayesian literature —the −bound— primarily designed for binary classification to the general case of ensemble methods of voters with arbitrary outputs. We provide a generic version of the −bound, an upper bound over the risk of models expressed as a weighted majority vote that is based on the first and second statistical moments of the vote's margin. On the one hand, this bound may advantageously be applied on more complex outputs than mere binary outputs, such as multiclass labels and multilabel, and on the other hand, it allows us to consider margin relaxations. We provide a specialization of the bound to multiclass classification together with empirical evidence that the presented theoretical result is tightly bound to the risk of the majority vote classifier. We also give insights as to how the proposed bound may be of use to characterize the risk of multilabel predictors.
Complete list of metadatas

Cited literature [45 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01774837
Contributor : Liva Ralaivola <>
Submitted on : Monday, June 4, 2018 - 2:08:35 PM
Last modification on : Thursday, April 4, 2019 - 10:18:05 AM
Long-term archiving on : Wednesday, September 5, 2018 - 12:21:52 PM

Identifiers

Collections

Citation

François Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy. Risk upper bounds for general ensemble methods with an application to multiclass classification. Neurocomputing, Elsevier, 2017, 219, pp.15 - 25. ⟨10.1016/j.neucom.2016.09.016⟩. ⟨hal-01774837⟩

Share

Metrics

Record views

124

Files downloads

137