E. Fix and J. L. Hodges, Discriminatory Analysis. Nonparametric Discrimination: Consistency Properties, International Statistical Review / Revue Internationale de Statistique, vol.57, issue.3, 1951.
DOI : 10.2307/1403797

R. Bajcsy and S. Kova?i?, Multiresolution elastic matching, Computer Vision, Graphics, and Image Processing, vol.46, issue.1, pp.1-21, 1989.
DOI : 10.1016/S0734-189X(89)80014-3

G. E. Hinton, C. K. Williams, and M. D. Revow, Adaptative elastic models for hand-printed character recognition, Advances in Neural Information Processing Systems, pp.512-519, 1992.

H. Schwenk and M. Milgram, Transformation invariant autoassociation with application to handwritten character recognition, Advances in Neural Information Processing Systems, pp.992-998, 1995.

C. M. Bishop, Neural Networks for Pattern Recognition, 1995.

C. J. Burges, A tutorial on support vector machines for pattern recognition, Data Mining and Knowledge Discovery, vol.2, issue.2, pp.121-167, 1998.
DOI : 10.1023/A:1009715923555

R. A. Jacobs, M. I. Jordan, S. J. Nowlan, and G. E. Hinton, Adaptive Mixtures of Local Experts, Neural Computation, vol.4, issue.1, pp.79-87, 1991.
DOI : 10.1162/neco.1989.1.2.281

S. B. Ronan-collobert and Y. Bengio, A Parallel Mixture of SVMs for Very Large Scale Problems, Neural Computation, vol.20, issue.5, pp.1105-1114, 2002.
DOI : 10.1162/neco.1991.3.1.79

M. K. Titsias and A. Likas, Mixture of Experts Classification Using a Hierarchical Mixture Model, Neural Computation, vol.58, issue.9, pp.2221-2244, 2002.
DOI : 10.1214/aos/1176346060

S. Akaho and H. J. Kappen, Nonmonotonic Generalization Bias of Gaussian Mixture Models, Neural Computation, vol.39, issue.6, pp.1411-1427, 2000.
DOI : 10.1103/PhysRevLett.65.945

I. Ulusoy and C. M. Bishop, Generative versus Discriminative Methods for Object Recognition, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), pp.258-265, 2005.
DOI : 10.1109/CVPR.2005.167

G. Bouchard and B. Triggs, The tradeoff between generative and discriminative classifiers, Proceedings in Computational Statistics, 16th Symposium of IASC Prague: Physica-Verlag, 2004.
URL : https://hal.archives-ouvertes.fr/inria-00548546

C. M. Julia, A. Lasserre, and T. P. Minka, Principled hybrids of generative and discriminative models, EEE Computer Society Conference on Computer Vision and Pattern Recognition, pp.87-94, 2006.

E. Anquetil, B. Coüasnon, and F. Dambreville, A Symbol Classifier Able to Reject Wrong Shapes for Document Recognition Systems, Graphics Recognition, Recent Advances, pp.209-218, 2000.
DOI : 10.1007/3-540-40953-X_17

C. S. Abou-moustafa and M. Cheriet, A generative-discriminative hybrid for sequential data classification, IEEE International Conference on Acoustics, Speech, and Signal Processing, pp.805-808, 2004.

K. R. Ianakiev and V. Govindaraju, Improvement of recognition accuracy using 2-stage classification, Proc. of the Seventh International Workshop on Frontiers in Handwriting Recognition, L. Schomaker and L. Vuurpijl, pp.153-165, 2000.

N. Giusti, F. Masulli, and A. Sperduti, Theoretical and experimental analysis of a two-stage system for classification, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.7, pp.893-904, 2002.
DOI : 10.1109/TPAMI.2002.1017617

E. Alpaydin, C. Kaynak, and F. Alimo?-glu, Cascading multiple classifiers and representations for optical and pen-based handwritten digit recognition, Proc. of the 7th International Workshop on Frontiers in Handwriting Recognition, pp.453-462, 2000.

L. G. Vuurpijl and L. R. Schomaker, Two-stage character classification: A combined approach of clustering and support vector classifiers, Proc. of the Seventh International Workshop on Frontiers in Handwriting Recognition, L. Schomaker and L. Vuurpijl, pp.423-432, 2000.

L. Prevost, A. Moises, C. Michel-sendis, L. Oudot, and M. Milgram, Combining model-based and discriminative classifiers: application to handwritten character recognition, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings., pp.31-35, 2003.
DOI : 10.1109/ICDAR.2003.1227623

C. Hsu and C. Lin, A comparison of methods for multi-class support vector machines, 2001.

E. Mayoraz and E. Alpaydin, Support vector machines for multi-class classification, Proceedings of the International Workshop on Artifical Neural Networks (IWANN99), pp.833-842, 1999.
DOI : 10.1007/BFb0100551

D. M. Tax and R. P. Duin, Using two-class classifiers for multiclass classification, Object recognition supported by user interaction for service robots, pp.124-127, 2002.
DOI : 10.1109/ICPR.2002.1048253

H. Mouchre and E. Anquetil, A Unified Strategy to Deal with Different Natures of Reject, 18th International Conference on Pattern Recognition (ICPR'06), pp.792-795, 2006.
DOI : 10.1109/ICPR.2006.193

T. Kohonen, The self-organizing map, Proceedings of the IEEE, vol.78, issue.9, pp.1464-1480, 1990.
DOI : 10.1109/5.58325

A. Dempster, N. Laird, and D. Rubin, Maximum likelihood from incomplete data via the EM algorithm, Journal of the Royal Statistical Society, Series B, vol.39, issue.1, pp.1-38, 1977.

P. Mitra, C. Murthy, and S. K. , Density-based multiscale data condensation, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.6, pp.734-747, 2002.
DOI : 10.1109/TPAMI.2002.1008381

R. Krishnapuram and J. M. Keller, A possibilistic approach to clustering, IEEE Transactions on Fuzzy Systems, vol.1, issue.2, pp.98-110, 1993.
DOI : 10.1109/91.227387

R. Krishnapuram, Generation of membership functions via possibilistic clustering, Proceedings of 1994 IEEE 3rd International Fuzzy Systems Conference, pp.902-908, 1994.
DOI : 10.1109/FUZZY.1994.343851

R. Krishnapuram and J. M. Keller, The possibilistic C-means algorithm: insights and recommendations, IEEE Transactions on Fuzzy Systems, vol.4, issue.3, pp.385-393, 1996.
DOI : 10.1109/91.531779

X. L. Xie and G. Beni, A validity measure for fuzzy clustering, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.13, issue.8, pp.841-847, 1991.
DOI : 10.1109/34.85677

C. Z. Janikow, Fuzzy decision trees: issues and methods, IEEE Transactions on Systems, Man and Cybernetics, Part B (Cybernetics), vol.28, issue.1, pp.1-14, 1998.
DOI : 10.1109/3477.658573

C. Marsala and B. Bouchon-meunier, Choice of a method for the construction of fuzzy decision trees, The 12th IEEE International Conference on Fuzzy Systems, 2003. FUZZ '03., pp.584-589, 2003.
DOI : 10.1109/FUZZ.2003.1209429

C. Olaru and L. Wehenkel, A complete fuzzy decision tree technique Fuzzy Sets and Systems, pp.221-254, 2003.

J. Y. Hsu and I. Chiang, Fuzzy classification trees, Ninth International Symposium on Artificial Intelligence in Joint Cooperation with the Sixth International Conference on Industrial Fuzzy Control and Intelligent Systems, pp.431-439, 1996.

Y. Yuan and M. J. Shaw, Induction of fuzzy decision trees Fuzzy sets and systems, pp.125-139, 1995.

N. Ragot and E. Anquetil, A new hybrid learning method for fuzzy decision trees, 10th IEEE International Conference on Fuzzy Systems. (Cat. No.01CH37297), pp.1380-1383, 2001.
DOI : 10.1109/FUZZ.2001.1008915

URL : https://hal.archives-ouvertes.fr/hal-01191729

J. C. Bezdek, Pattern recognition with fuzzy objective function algorithms, 1981.
DOI : 10.1007/978-1-4757-0450-1

H. Tanaka, T. Okuda, and K. Asai, Fuzzy information and decision in statistical model, Advances in Fuzzy Set Theory and Applications, pp.303-320, 1979.

C. Marsala and B. Bouchon-meunier, Measures of discrimination for the construction of fuzzy decision trees, Proc. of Fuzzy Information Processing, pp.709-714, 2003.

E. Anquetil and G. Lorette, Automatic generation of hierarchical fuzzy classification systems based on explicit fuzzy rules deduced from possibilistic clustering: Application to on-line handwritten character recognition, Information Processing and Management of Uncertainty in Knowledge-Based Systems (IPMU'96), pp.259-264, 1996.

D. Bahler and L. Navarro, Methods for combining heterogeneous sets of classifiers, Proc. of the 7th National Conference on Artificial Intelligence Workshop on New Research Problems for Machine Learning, 2000.

D. M. Tax, M. Van-breukelen, R. P. Duin, and J. Kittler, Combining multiple classifiers by averaging or by multiplying?, Pattern Recognition, vol.33, issue.9, pp.1475-1485, 2000.
DOI : 10.1016/S0031-3203(99)00138-7

I. Bloch, Information combination operators for data fusion: a comparative review with classification, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol.26, issue.1, pp.52-67, 1996.
DOI : 10.1109/3468.477860

J. Kittler, M. Hatef, R. P. Duin, and J. Matas, On combining classifiers, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.20, issue.3, pp.226-239, 1998.
DOI : 10.1109/34.667881

R. Collobert and S. Bengio, SVMTorch: Support vector machines for large-scale regression problems, Journal of Machine Learning Research, vol.1, pp.143-160, 2001.

C. Viard-gaudin, P. M. Lallican, S. Knerr, and P. Binter, The IRESTE On/Off (IRONOFF) dual handwriting database, Proceedings of the Fifth International Conference on Document Analysis and Recognition. ICDAR '99 (Cat. No.PR00318), pp.455-458, 1999.
DOI : 10.1109/ICDAR.1999.791823

T. Lim, W. Loh, and Y. Shih, A comparison of prediction accuracy, complexity, and training time of thirtythree old and new classification algorithms, Machine Learning, pp.203-228, 2000.