Resampling methods for parameter-free and robust feature selection with mutual information - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Neurocomputing Année : 2007

Resampling methods for parameter-free and robust feature selection with mutual information

Résumé

Combining the mutual information criterion with a forward feature selection strategy offers a good trade-off between optimality of the selected feature subset and computation time. However, it requires to set the parameter(s) of the mutual information estimator and to determine when to halt the forward procedure. These two choices are difficult to make because, as the dimensionality of the subset increases, the estimation of the mutual information becomes less and less reliable. This paper proposes to use resampling methods, a K-fold cross-validation and the permutation test, to address both issues. The resampling methods bring information about the variance of the estimator, information which can then be used to automatically set the parameter and to calculate a threshold to stop the forward procedure. The procedure is illustrated on a synthetic dataset as well as on real-world examples.
Fichier principal
Vignette du fichier
permtest.pdf (239.98 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00174298 , version 1 (23-09-2007)

Identifiants

Citer

Damien François, Fabrice Rossi, Vincent Wertz, Michel Verleysen. Resampling methods for parameter-free and robust feature selection with mutual information. Neurocomputing, 2007, 70 (7-9), pp.1276-1288. ⟨10.1016/j.neucom.2006.11.019⟩. ⟨inria-00174298⟩

Collections

INRIA INRIA2
153 Consultations
246 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More