E. Allwein, R. Schapire, and Y. Singer, Reducing multiclass to binary: A unifying approach for margin classifiers, Journal of Machine Learning Research, vol.1, pp.113-141, 2001.

C. Blake and C. Merz, UCI Repository of machine learning databases, 1998.

G. Blanchard, Different Paradigms for Choosing Sequential Reweighting Algorithms, Neural Computation, vol.30, issue.1, pp.811-836, 2004.
DOI : 10.1214/aos/1024691352

B. Boser, I. Guyon, and V. Vapnik, A training algorithm for optimal margin classifiers, Proceedings of the fifth annual workshop on Computational learning theory , COLT '92, pp.144-152, 1992.
DOI : 10.1145/130385.130401
URL : http://www.svms.org/training/BOGV92.pdf

L. Breiman, Bagging predictors, Machine Learning, vol.10, issue.2, pp.123-140, 1996.
DOI : 10.2307/1403680
URL : https://link.springer.com/content/pdf/10.1007%2FBF00058655.pdf

L. Breiman, Random Forests, Machine Learning, vol.45, issue.1, pp.5-32, 2001.
DOI : 10.1023/A:1010933404324

C. Brouard, F. Buc, and M. Szafranski, Semi-supervised penalized output kernel regression for link prediction, International Conference on Machine Learning (ICML-11, pp.593-600, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00654123

O. Catoni, PAC-Bayesian supervised classification: the thermodynamics of statistical learning, IMS Lecture Notes Monograph Series, vol.56, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00206119

C. Cortes, V. Kuznetsov, and M. Mohri, Ensemble methods for structured prediction, Proceedings of the 31st International Conference on Machine Learning (ICML-14, pp.1134-1142, 2014.

C. Cortes, . Mohri, and J. Weston, A general regression framework for learning string-to-string mappings . Predicting Structured Data, pp.143-168, 2007.

C. Cortes and V. Vapnik, Support-vector networks, Machine Learning, vol.1, issue.3, pp.273-297, 1995.
DOI : 10.1007/BF00994018

T. Dietterich and G. Bakiri, Solving multiclass learning problems via error-correcting output codes, Journal of Artificial Intelligence Research, vol.2, issue.263, p.286, 1995.

T. G. Dietterich, Ensemble Methods in Machine Learning, Multiple Classifier Systems, pp.1-15, 2000.
DOI : 10.1007/3-540-45014-9_1

P. Domingos, Bayesian averaging of classifiers and the overfitting problem, International Conference on Machine Learning, pp.223-230, 2000.

Y. Freund and R. Schapire, A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting, Journal of Computer and System Sciences, vol.55, issue.1, pp.119-139, 1997.
DOI : 10.1006/jcss.1997.1504
URL : https://doi.org/10.1006/jcss.1997.1504

T. Gärtner, A survey of kernels for structured data, ACM SIGKDD Explorations Newsletter, vol.5, issue.1, pp.49-58, 2003.
DOI : 10.1145/959242.959248

A. Gelman, J. Carlin, H. Stern, and D. Rubin, Bayesian data analysis, 2004.

P. Germain, A. Lacasse, F. Laviolette, and M. Marchand, PAC-Bayesian learning of linear classifiers, Proceedings of the 26th Annual International Conference on Machine Learning, ICML '09, pp.353-360, 2009.
DOI : 10.1145/1553374.1553419
URL : http://www.cs.mcgill.ca/~icml2009/papers/89.pdf

P. Germain, A. Lacasse, F. Laviolette, M. Marchand, and J. Roy, Risk bounds for the majority vote: From a PAC-Bayesian analysis to a learning algorithm, Journal of Machine Learning Research, vol.16, pp.787-860, 2015.

S. Giguere, F. Laviolette, M. Marchand, and A. Rolland, Pac-bayesian risk bounds and learning algorithms for the regression approach to structured output prediction Advanced Structured Prediction, p.239, 2014.

D. Haussler, M. Kearns, and R. Schapire, Bounds on the sample complexity of bayesian learning using information theory and the VC dimension, Machine Learning, vol.14, issue.1, pp.83-113, 1994.

V. Kuznetsov, M. Mohri, and U. Syed, Multi-class deep boosting, Advances in Neural Information Processing Systems, pp.2501-2509, 2014.

A. Lacasse, F. Laviolette, M. Marchand, P. Germain, and N. Usunier, PAC-Bayes bounds for the risk of the majority vote and the variance of the Gibbs classifier, Advances in Neural Information Processing Systems, pp.769-776, 2007.

J. Langford and J. Shawe-taylor, PAC-Bayes & margins, Advances in Neural Information Processing Systems, pp.423-430, 2002.

F. Laviolette, M. Marchand, and J. Roy, From PAC-Bayes bounds to quadratic programs for majority votes, International Conference on Machine Learning, pp.649-656, 2011.

L. Li, B. Zou, Q. Hu, X. Wu, and D. Yu, Dynamic classifier ensemble using classification confidence, Neurocomputing, vol.99, pp.581-591, 2013.
DOI : 10.1016/j.neucom.2012.07.026

T. Liu and D. Tao, Classification with Noisy Labels by Importance Reweighting, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.38, issue.3, pp.447-461, 2016.
DOI : 10.1109/TPAMI.2015.2456899
URL : http://arxiv.org/pdf/1411.7718

D. Mcallester, Some PAC-Bayesian theorems, Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, pp.355-363, 1999.
DOI : 10.1145/279943.279989

D. Mcallester, Simplified PAC-Bayesian Margin Bounds, pp.203-215, 2003.
DOI : 10.1007/978-3-540-45167-9_16

D. Mcallester, Generalization bounds and consistency for structured labeling In: Predicting Structured Data, pp.247-262, 2009.

E. Morvant, A. Habrard, and S. Ayache, Majority Vote of Diverse Classifiers for Late Fusion, IAPR Joint International Workshops on Statistical Techniques in Pattern Recognition and Structural and Syntactic Pattern Recignition, pp.153-162, 2014.
DOI : 10.1007/978-3-662-44415-3_16
URL : https://hal.archives-ouvertes.fr/hal-00985839

E. Morvant, S. Koço, and L. Ralaivola, PAC-Bayesian generalization bound on confusion matrix for multi-class classification, International Conference on Machine Learning, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00674847

Y. Mroueh, T. Poggio, L. Rosasco, and J. Slotine, Multiclass learning with simplex coding, Advances in Neural Information Processing Systems, pp.2789-2797, 2012.

I. Mukherjee and R. Schapire, A theory of multiclass boosting, Journal of Machine Learning Research, vol.14, issue.1, pp.437-497, 2013.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in Python, Journal of Machine Learning Research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

M. Re and G. Valentini, Ensemble Methods, pp.563-582, 2012.
DOI : 10.1201/b11822-34

J. Read, B. Pfahringer, G. Holmes, and E. Frank, Classifier chains for multi-label classification, Machine Learning, vol.40, issue.7, pp.333-359, 2011.
DOI : 10.1016/j.patcog.2006.12.019
URL : https://link.springer.com/content/pdf/10.1007%2F978-3-642-04174-7_17.pdf

R. Schapire and Y. Singer, Improved boosting algorithms using confidence-rated predictions, Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, pp.80-91, 1999.
DOI : 10.1145/279943.279960
URL : http://www.iro.umontreal.ca/~kegl/ift3390/2006_1/Lectures/l08_ConfidenceRatedAdaBoostSchapireSinger.pdf

M. Seeger, 10.1162/153244303765208377, CrossRef Listing of Deleted DOIs, vol.7, issue.5, pp.233-269, 2003.
DOI : 10.1016/S0004-3702(98)00002-2

Y. Seldin and N. Tishby, PAC-Bayesian analysis of co-clustering and beyond, Journal of Machine Learning Research, vol.11, pp.3595-3646, 2010.

S. Sun, A survey of multi-view machine learning, Neural Computing and Applications, vol.43, issue.7-8, pp.2031-2038, 2013.
DOI : 10.1016/j.patcog.2010.04.004

G. Tsoumakas and I. Vlahavas, Random k-Labelsets: An Ensemble Method for Multilabel Classification, European Conference on Machine Learning, pp.406-417, 2007.
DOI : 10.1007/978-3-540-74958-5_38
URL : https://link.springer.com/content/pdf/10.1007%2F978-3-540-74958-5_38.pdf

J. Yu, Y. Rui, and D. Tao, Click prediction for web image reranking using multimodal sparse coding, IEEE Transactions on Image Processing, vol.23, issue.5, pp.2019-2032, 2014.

Y. Zhang and J. Schneider, Maximum margin output coding, International Conference on Machine Learning, pp.1575-1582, 2012.

J. Zhu, H. Zou, S. Rosset, and T. Hastie, Multi-class adaboost, Statistics and its Interface, vol.2, issue.3, pp.349-360, 2009.