D. D. Aha and . Kibler, Instance-based learning algorithms, Machine Learning, vol.57, issue.1, pp.37-66, 1991.
DOI : 10.1007/BF00153759

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.138.635

M. Attik, J. Lamirel, and &. S. Shehabi, Clustering analysis for data with multiple labels, Proceedings of the IASTED International Conference on Databases and Applications (DBA), 2006.
URL : https://hal.archives-ouvertes.fr/hal-00104603

R. Battiti, Using mutual information for selecting features in supervised neural net learning, IEEE Transactions on Neural Networks, vol.5, issue.4, pp.537-550, 1994.
DOI : 10.1109/72.298224

L. Breiman, Random forests, Machine Learning, vol.45, issue.1, pp.5-32, 2001.
DOI : 10.1023/A:1010933404324

V. Bolón-canedo, N. Sánchez-maroño, and &. A. Alonso-betanzos, A review of feature selection methods on synthetic data, Knowledge and Information Systems, vol.97, issue.1, pp.1-37, 2012.
DOI : 10.1007/s10115-012-0487-8

L. Breiman, J. H. Friedman, R. A. Olshen, and &. C. Stone, Classification and Regression Trees, BRE, vol.84, 1984.

I. Cohen, T. X. Qi, Z. Sean, S. Xiang, T. Zhou et al., Feature Selection Using Principal Feature Analysis, 2002.

C. Nello, H. Lodhi, and &. J. Shawe-taylor, Latent Semantic Kernels for Feature Selection, 2000.

I. Falk, C. Gardent, and &. Lamirel, Classifying French Verbs using French and English Lexical Resources Proceedings of ACL 2012, 2012.

G. Forman, An extensive empirical study of feature selection metrics for text classification, The Journal of Machine Learning Research, vol.3, pp.1289-1305, 2003.

M. Ghribi, P. Cuxac, J. Lamirel, and &. A. Lelu, Mesures de qualité de clustering de documents : Prise en compte de la distribution des mots-clés, Proceedings of the 10th International Francophone Conference on Knowledge Extraction and Management, 2010.

I. Guyon, J. Weston, S. Barnhill, and &. V. Vapnik, Gene selection for cancer classification using support vector machines, Machine learning 46, pp.389-422, 2002.

I. A. Guyon and . Elisseeff, An introduction to variable and feature selection, The Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003.

H. Hall and M. A. Smith, Feature Selection for Machine Learning: Comparing a Correlation- Based Filter Approach to the Wrapper, Proceedings of the Twelfth International Florida Artificial Intelligence Research Society Conference, pp.235-239, 1999.

K. Hajlaoui, P. Cuxac, J. Lamirel, and &. C. François, Enhancing Patent Expertise through Automatic Matching with Scientific Papers, Discovery Science LNCS, vol.7569, pp.299-312, 2012.
DOI : 10.1007/978-3-642-33492-4_24

URL : https://hal.archives-ouvertes.fr/hal-00962386

. Kee, S. Keerthi, S. Shevade, C. Bhattacharyya, and &. K. Murthy, Improvements to platt's smo algorithm for svm classifier design, Neural Computation, vol.13, issue.3, pp.637-649, 2001.

R. G. Kohavi and . John, Wrappers for feature subset selection, Artificial Intelligence, vol.97, issue.1-2, pp.273-324, 1997.
DOI : 10.1016/S0004-3702(97)00043-X

I. Kononenko, Estimating attributes: Analysis and extensions of RELIEF, European Conference on Machine Learning, pp.171-182, 1994.
DOI : 10.1007/3-540-57868-4_57

L. T. Ladha and . Deepa, Feature selection methods and algorithms, International Journal on Computer Science and Engineering, vol.3, issue.5, pp.1787-1797, 2011.

S. R. Lallich and . Rakotomalala, Fast Feature Selection Using Partial Correlation for Multivalued Attributes, Principles of Data Mining and Knowledge Discovery, édité par Djamel A. Zighed, 1910.

J. Lamirel, S. Shehabi, C. Francois, and &. M. Hoffmann, New classification quality estimators for analysis of documentary information: Application to patent analysis and web mapping, Scientometrics, vol.60, issue.3, p.60, 2004.
DOI : 10.1023/B:SCIE.0000034386.05278.e8

URL : https://hal.archives-ouvertes.fr/hal-00105509

. Lam, J. Lamirel, M. Ghribi, and &. P. Cuxac, Unsupervised recall and precision measures: a step towards new efficient clustering quality indexes, Proceedings of the 19th International Conference on Computational Statistics (COMPSTAT'2010), 2010.

. Lam, J. Lamirel, N. Priyankar, P. Cuxac, and &. G. Safi, Mining research topics evolving over time using a diachronic multi-source approach, Proceedings of ICDM 2010 International Workshop on Mining Multiple Information Sources, 2010.

J. Lamirel, R. Mall, P. Cuxac, and &. G. Safi, A New Efficient and Unbiased Approach for Clustering Quality Evaluation, Proceedings of PAKDD 2010 2nd International Workshop on Quality Issues, Measures of Interestingness and Evaluation of Data Mining Models (QIMIE), 2011.
DOI : 10.1007/978-3-642-28320-8_18

URL : https://hal.archives-ouvertes.fr/hal-00955498

. Lam, J. Lamirel, R. Mall, P. Cuxac, and &. G. Safi, Variations to incremental growing neural gas algorithm based on label maximization, Proceedings of IJCNN 2011, 2011.

. Lam and J. Lamirel, A new approach for automatizing the analysis of research topics dynamics: application to optoelectronics research, Scientometrics, vol.93, pp.151-166, 2012.

. Lam, J. Lamirel, and &. D. Reymond, Automatic websites classification and retrieval using websites communication signatures, th International Conference on WIS and 13 th Collnet Meeting, 2012.

M. , E. Sucar, and &. G. Arroyo, Feature selection with a perceptron neural net. Feature Selection for Data Mining: Interfacing Machine Learning and Statistics, p.131, 2006.

J. Novakovic, Using Information Gain Attribute Evaluation to Classify Sonar Targets, International Journal of Image Processing, 2008.

H. Peng, F. Long, and &. C. Ding, Feature selection based on mutual information criteria of maxdependency , max-relevance, and min-redundancy. Pattern Analysis and Machine Intelligence, IEEE Transactions on, vol.27, issue.8, pp.1226-1238, 2005.

R. Quinlan, C4.5: Programs for Machine Learning, 1993.

R. S. Rakotomalala and . Lallich, Construction d'arbres de d??cision par optimisation, Revue Extraction des Connaissances et Apprentissage, pp.685-703, 2002.
DOI : 10.3166/ria.16.685-703

G. Salton, Automatic processing of foreign language documents, 1971.

G. C. Salton and . Buckley, Term-weighting approaches in automatic text retrieval, Information Processing & Management, vol.24, issue.5, pp.513-523, 1988.
DOI : 10.1016/0306-4573(88)90021-0

H. Schmid, Probabilistic part-of-speech tagging using decision trees, Proceedings of International Conference on New Methods in Language Processing, 1994.

I. H. Witten and . Frank, Data mining, ACM SIGMOD Record, vol.31, issue.1, 2005.
DOI : 10.1145/507338.507355

F. T. Zhang and . Oles, Text categorization based on regularized linear classification methods, Information Retrieval, vol.4, issue.1, pp.5-31, 2001.
DOI : 10.1023/A:1011441423217

J. Zhong, D. Xiongbing, L. Jie, L. Xue, and &. L. Chuanwei, A Novel Chinese Text Feature Selection Method Based on Probability Latent Semantic Analysis, Advances in Neural Network, 2010.
DOI : 10.1007/978-3-642-13318-3_35