I. Guyon and A. Elisseeff, An introduction to variable and feature selection, Journal of Machine Learning Research, vol.3, pp.1157-1182, 2003.

A. Verikas and M. Bacauskiene, Feature selection with neural networks, Pattern Recognition Letters, vol.23, issue.11, pp.1323-1335, 2002.
DOI : 10.1016/S0167-8655(02)00081-8

G. M. Fung and O. Mangasarian, A Feature Selection Newton Method for Support Vector Machine Classification, Computational Optimization and Applications, vol.28, issue.2, pp.185-202, 2004.
DOI : 10.1023/B:COAP.0000026884.66338.df

B. Hammer and T. Villmann, Generalized relevance learning vector quantization, Neural Networks, vol.15, issue.8-9, pp.1059-1068, 2002.
DOI : 10.1016/S0893-6080(02)00079-5

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.13.1718

G. V. Dijck and M. M. Hulle, Speeding up the wrapper feature subset selection in regression by mutual information relevance and redundancy analysis, ICANN 2006: International Conference in Aritificial Neural Networks, 2006.

F. Rossi, A. Lendasse, D. François, V. Wertz, and M. Verleysen, Mutual information for the selection of relevant variables in spectrometric nonlinear modelling, Chemometrics and Intelligent Laboratory Systems, vol.80, issue.2, pp.215-226, 2006.
DOI : 10.1016/j.chemolab.2005.06.010

URL : https://hal.archives-ouvertes.fr/inria-00174077

D. Scott, Multivariable Density Estimation: Theory, Practice, and Visualization, 1992.

R. Bellmann, Adaptive Control Processes: A Guided Tour, 1961.
DOI : 10.1515/9781400874668

B. V. Bonnlander and A. S. Weigend, Selecting input variables using mutual information and nonparametric density estimation, Proc. of the, pp.42-50, 1994.

N. Kwak and C. Choi, Input feature selection by mutual information based on Parzen window, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.24, issue.12, pp.1667-1671, 2002.
DOI : 10.1109/TPAMI.2002.1114861

A. Kraskov, H. Stögbauer, and P. Grassberger, Estimating mutual information, Physical Review E, vol.69, issue.6, p.66138, 2004.
DOI : 10.1103/PhysRevE.69.066138

F. Rossi, A. Lendasse, D. François, V. Wertz, and M. Verleysen, Mutual information for the selection of relevant variables in spectrometric nonlinear modelling, Chemometrics and Intelligent Laboratory Systems, vol.80, issue.2, pp.215-226, 2005.
DOI : 10.1016/j.chemolab.2005.06.010

URL : https://hal.archives-ouvertes.fr/inria-00174077

R. Battiti, Using mutual information for selecting features in supervised neural net learning, IEEE Transactions on Neural Networks, vol.5, issue.4, pp.537-550, 1994.
DOI : 10.1109/72.298224

N. Kwak and C. Choi, Input feature selection for classification problems, IEEE Transactions on Neural Networks, vol.13, issue.1, pp.143-159, 2002.
DOI : 10.1109/72.977291

F. Fleuret, Fast binary feature selection with conditional mutual information, Journal of Machine Learning Research, vol.5, pp.1531-1555, 2004.

P. Good, Permutation Tests, 1994.

J. Opdyke, Fast Permutation Tests that Maximize Power Under Conventional Monte Carlo Sampling for Pairwise and Multiple Comparisons, Journal of Modern Applied Statistical Methods, vol.2, issue.1, pp.27-49, 2003.
DOI : 10.22237/jmasm/1051747500

R. Craddock, R. Taylor, G. Broderick, T. Whistler, N. Klimas et al., Exploration of statistical dependence between illness parameters using the entropy correlation coefficient, Pharmacogenomics, vol.7, issue.3, pp.421-428, 2006.
DOI : 10.2217/14622416.7.3.421

P. T. Hahn, J. Rahnenfhrer, and T. Lengauer, Confirmation of human protein interaction data by human expression data, BMC Bioinformatics, vol.6, issue.112, 2005.

J. Hummel, N. Keshvari, W. Weckwerth, and J. Selbig, Species-specific analysis of protein sequence motifs using mutual information, BMC Bioinformatics, vol.6, issue.164, 2005.

G. Purushothaman and D. C. Bradley, Neural population code for fine perceptual decisions in area MT, Nature Neuroscience, vol.2, issue.1, pp.99-106, 2005.
DOI : 10.1038/370140a0

N. Hoffman, C. Schiffer, and R. Swanstrom, Covariation of amino acid positions in HIV-1 protease, Virology, vol.314, issue.2, pp.536-548, 2003.
DOI : 10.1016/S0042-6822(03)00484-7

C. Diks and S. Manzan, Tests for Serial Independence and Linearity Based on Correlation Integrals, Studies in Nonlinear Dynamics & Econometrics, vol.6, issue.2, 2002.
DOI : 10.2202/1558-3708.1005

C. Conrad, H. Erfle, P. Warnat, N. Daigle, T. Lorch et al., Automatic Identification of Subcellular Phenotypes on Human Cell Arrays, Genome Research, vol.14, issue.6, pp.1130-1136, 2004.
DOI : 10.1101/gr.2383804

P. Radivojac, Z. Obradovic, K. Dunker, and S. Vucetic, Feature Selection Filters Based on the Permutation Test, Machine Learning: ECML 2004, 15th European Conference on Machine Learning, pp.334-346, 2004.
DOI : 10.1007/978-3-540-30115-8_32

E. Frank and I. H. Witten, Using a permutation test for attribute selection in decision trees, Proc. 15th International Conf. on Machine Learning, pp.152-160, 1998.

D. François, V. Wertz, and M. Verleysen, The permutation test for feature selection by mutual information, European Symposium on Artificial Neural Networks, pp.239-244, 2006.

X. Zhou, X. Wang, E. Dougherty, and D. R. Suh, Gene Clustering Based on Clusterwide Mutual Information, Journal of Computational Biology, vol.11, issue.1, pp.147-61, 2004.
DOI : 10.1089/106652704773416939

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.2.4385

N. Nicolaou and S. J. Nasuto, Mutual information for eeg analysis, Proc. 4th IEEE EMBSS UKRI Postgraduate Conference on Biomedical Engineering and Medical Physics (PGBIOMED'05), pp.23-24, 2005.

J. Friedman, Multivariate adaptive regression splines (with discussion), Annals of Statistics, vol.9, issue.1, pp.1-141, 1991.
DOI : 10.1214/aos/1176347963

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.382.970

N. Benoudjit and M. Verleysen, On the kernel widths in radial-basis function networks, Neural Processing Letters, vol.18, issue.2, pp.139-154, 2003.
DOI : 10.1023/A:1026289910256

F. Rossi, N. Delannay, B. Conan-guez, and M. Verleysen, Representation of functional data in neural networks, Neurocomputing, vol.64, pp.183-210, 2005.
DOI : 10.1016/j.neucom.2004.11.012

URL : https://hal.archives-ouvertes.fr/inria-00000666

K. Hild, D. Erdogmus, and J. Principe, Blind source separation using Renyi's mutual information, IEEE Signal Processing Letters, vol.8, issue.6, pp.174-176, 2001.
DOI : 10.1109/97.923043

A. Stefansson, N. Koncar, and A. J. Jones, A note on the Gamma test, Neural Computing & Applications, vol.3, issue.3, pp.131-133, 1997.
DOI : 10.1007/BF01413858