G. Bak?r, T. Hofmann, B. Schölkopf, A. J. Smola, B. Taskar et al., Predicting structured outputs, 2007.

A. Bordes and L. Bottou, The Huller: A Simple and Efficient Online SVM, Machine Learning: ECML 2005, pp.505-512, 2005.
DOI : 10.1007/11564096_48

URL : https://hal.archives-ouvertes.fr/hal-00752501

A. Bordes, S. Ertekin, J. Weston, and L. Bottou, Fast kernel classifiers with online and active learning, Journal of Machine Learning Research, vol.6, pp.1579-1619, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00752361

M. Collins, Discriminative training methods for hidden Markov models, Proceedings of the ACL-02 conference on Empirical methods in natural language processing , EMNLP '02, pp.1-8, 2002.
DOI : 10.3115/1118693.1118694

K. Crammer and Y. Singer, On the algorithmic implementation of multiclass kernel-based vector machines, Journal of Machine Learning Research, vol.2, pp.265-292, 2001.

K. Crammer and Y. Singer, Ultraconservative Online Algorithms for Multiclass Problems, Journal of Machine Learning Research, vol.3, pp.951-991, 2003.
DOI : 10.1007/3-540-44581-1_7

L. Denoyer and P. Gallinari, The XML document mining challenge Advances in XML Information Retrieval and Evaluation, 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX, Schloß Dagsthul, 2006.

Y. Freund and R. E. Schapire, Large margin classification using the perceptron algorithm, Proceedings of the eleventh annual conference on Computational learning theory , COLT' 98, 1998.
DOI : 10.1145/279943.279985

T. Graepel, R. Herbrich, and R. C. Williamson, From margin to sparsity, Advances in neural information processing systems, pp.210-216, 2000.

C. Hildreth, A quadratic programming procedure, Naval Research Logistics Quarterly, vol.49, issue.1, pp.79-85, 1957.
DOI : 10.1002/nav.3800040113

C. Hsu and C. Lin, A comparison of methods for multi-class support vector machines, IEEE Transactions on Neural Networks, vol.13, pp.415-425, 2002.

Y. Lecun, S. Chopra, R. Hadsell, J. Huangfu, M. Ranzato et al., A tutorial on energy-based learning, pp.192-241, 2007.

J. Platt, Fast training of support vector machines using sequential minimal optimization, Advances in Kernel Methods ? Support Vector Learning, pp.185-208, 1999.

R. M. Rifkin and A. Klautau, In defense of one-vsall classification, Journal of Machine Learning Research, vol.5, pp.101-141, 2004.

B. Schölkopf and A. J. Smola, Learning with kernels, 2002.

B. Taskar, Learning structured prediction models, Proceedings of the 22nd international conference on Machine learning , ICML '05, 2004.
DOI : 10.1145/1102351.1102464

B. Taskar, V. Chatalbashev, D. Koller, and C. Guestrin, Learning structured prediction models, Proceedings of the 22nd international conference on Machine learning , ICML '05, pp.896-903, 2005.
DOI : 10.1145/1102351.1102464

I. Tsochantaridis, T. Joachims, T. Hofmann, and Y. Altun, Large margin methods for structured and interdependent output variables, Journal of Machine Learning Research, vol.6, pp.1453-1484, 2005.

J. Weston and C. Watkins, Multi-class support vector machines, 1998.